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1 Introduction 

1.1 What is Direct Energy Conversion? 

Energy conversion devices convert between electrical, magnetic, kinetic, po¬ 
tential, optical, chemical, nuclear, and other forms of energy. Energy con¬ 
version processes occur naturally. Eor example, energy is converted from 
optical electromagnetic radiation to heat when sunlight warms a house, 
and energy is converted from potential energy to kinetic energy when a 
leaf falls from a tree. Alternatively, energy conversion devices are designed 
and manufactured by a wide range of scientists and engineers. These en¬ 
ergy conversion devices range from tiny integrated circuit components such 
as thermocouples which are used to sense temperature by converting mi¬ 
crowatts of power from thermal energy to electricity to enormous coal power 
plants which convert gigawatts of energy stored in the chemical bonds of 
coal into electricity. 

A direct energy conversion device converts one form of energy to an¬ 
other through a single process. Eor example, a solar cell is a direct energy 
conversion device that converts optical electromagnetic radiation to elec¬ 
tricity. While some of the sunlight that falls on a solar cell may heat it up 
instead, that effect is not fundamental to the solar cell operation. Alterna¬ 
tively, indirect energy conversion devices involve a series of direct energy 
conversion processes. Eor example, some solar power plants involve con¬ 
verting optical electromagnetic radiation to electricity by heating a fluid 
so that it evaporates. The evaporation and expansion of the gas spin a 
rotor of a turbine. The energy from the mechanical motion of the rotor is 
converted to a time varying magnetic held which is then converted to an 
alternating electrical current in the coils of the generator. 

This text focuses on direct energy conversion devices which convert be¬ 
tween electrical energy and another form. Because of the wide variety of 
devices that fit in this category, energy conversion is a topic important 
to all types of electrical engineers. Some electrical engineers specialize 
in building instrumentation systems. Many sensors used by these engi¬ 
neers are direct energy conversion devices, including strain gauges used to 
measure pressure. Hall effect sensors which measure magnetic held, and 
piezoelectric sensors used to detect mechanical vibrations. The electrical 
energy produced in a sensor may be so small that amplification is required. 
Other electrical engineers specialize in the production and distribution of 
electrical power. Batteries and solar cells are direct energy conversion de¬ 
vices used to store and generate electricity. They are particularly useful 
in remote locations or in hand held gadgets where there is no easy way 
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to connect to the power grid. Relatedly, direct energy conversion devices 
such as thermoelectric devices and fuel cells are used to power satellites, 
rovers, and other aerospace systems. Many electrical engineers work in 
the automotive industry. Direct energy conversion devices found in cars 
include batteries, optical cameras. Hall effect sensors in tachometer used 
to measure rotation speed, and pressure sensors. 

Direct energy conversion is a fascinating topic because it does not fit 
neatly into a single discipline. Energy conversion is fundamental to the 
fields of electrical engineering, but it is also fundamental to mechanical 
engineering, physics, chemistry, and other branches of science and engi¬ 
neering. For example, springs are energy storage devices often studied by 
mechanical engineers, capacitors are energy storage devices often studied 
by electrical engineers, and batteries are energy storage devices often stud¬ 
ied by chemists. Relatedly, energy storage and energy conversion devices, 
such as springs, capacitors, and batteries, are not esoteric. They are com¬ 
monplace, cheap, and widely available. While they are found in everyday 
objects, they are active subjects of contemporary research too. For exam¬ 
ple, laptop computers are limited by the lifetime of batteries, and cell phone 
reception is often limited by the quality of an antenna. Batteries, antennas, 
and other direct energy conversion devices are studied by both consumer 
companies trying to build better products and academic researchers trying 
to understand fundamental physics. 

1.2 Preview of Topics 

This book is intended to both illustrate individual energy conversion tech¬ 
nologies and illuminate the relationship between them. For this reason, it 
is organized in two parts. The first part is a survey of energy conversion 
processes. The second part introduces calculus of variations and uses it as 
a framework to relate energy conversion processes. 

Due to the wide variety, it is not possible to discuss all energy conversion 
devices, even all direct devices, in detail. However, by studying the example 
direct energy conversion processes, we can gain an understanding of indi¬ 
rect processes and other applications. The devices discussed in this book 
involve energy conversion between electrical form and another form. Addi¬ 
tionally, devices involving magnets and coils will not be discussed. Many 
useful devices, including motors, generators, wind turbines, and geothermal 
power plants, convert energy electromagnetically using magnets and coils. 
Approximately 90% of power supplied to the electrical grid in the United 
States comes from generators that use magnets and coils [1]. Also, about 
2/3 to 3/4 of energy used by manufacturing facilities goes towards motors 
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[2, ch. 1], However, plenty of good resources discussing these topics exist. 
Furthermore, this book emphasizes device that operate near room temper¬ 
ature and at relatively low power (<1 kW). Many interesting devices, such 
as nuclear power plants, operate at high temperatures. One reason not to 
discuss more powerful devices is that the vast majority of large electrical 
generators in use today involve turbines with coils and magnets. Another 
reason is that these devices are often limited by material considerations. 
Finding materials to construct high temperature devices is a challenging 
problem, but it is not the purpose of this book. Additionally, only technolo¬ 
gies commercially available on the market today are discussed in this book. 
Also, many quality texts exist on the topics of renewable and alternative 
energy sources. For this reason, this book will not focus on renewable or 
alternative energy technologies. Topics like wind turbines, which involve 
electromechanical energy conversion with magnets and coils, are not dis¬ 
cussed. Solar cells, piezoelectric devices, and other direct energy conversion 
devices are discussed and can be considered both direct energy conversion 
devices and renewable energy devices. 

While a few books on direct energy conversion exist, there are few 
things which set this book apart. First, many of the books on direct en¬ 
ergy conversion, including [3] and [4], are written at the graduate level 
while this book is aimed at a more general audience. This book is used 
for the course Direct Energy Conversion taught at Trine University, which 
is a junior undergraduate level course for electrical engineers. This book 
is not intended only for electrical engineering students. It is also aimed 
at researchers who are interested in how energy conversion is studied by 
scientists and engineers in other disciplines. The idea of energy conversion 
is fundamental to physics, chemistry, mechanical engineering, and multiple 
other disciplines. This book discusses fundamental physics behind energy 
conversion processes, introduces terminology used, and relates concepts of 
material science used for building devices. The chapters were written so 
that someone who is not an antenna designer, for example, can read the 
relevant chapter as an introduction and gain insights into some of the ter¬ 
minology and key concepts used by electromagnetics researchers. Second, 
a number of good books on the topic, including [3] and [5] were written 
decades ago. The concepts of these books remain relevant, and these books 
often predicted which technologies would be of interest. However, there is 
a need for a book which discusses the most accessible and commonplace 
direct energy conversion technologies in use today. Additionally, many of 
these classic texts are out of print, and contemporary texts are needed. 

The reader is assumed to be familiar with introductory chemistry and 
physics. Background in electrical circuits and materials may also be helpful. 
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Math through Calculus I is used in the first part of the book, and math 
through Calculus III (including partial derivatives) is used in the second 
part. Many topics in this text are discussed qualitatively. No attempt 
is made to be mathematically rigorous, and proofs are not given. The 
physics of devices is emphasized over excessive mathematics. Additionally, 
all physical systems will be discussed semiclassically, which means that 
explanations will involve electrons and electromagnetic fields, but the wave- 
particle duality of these quantities will not be discussed. While quantum 
mechanical, quantum field theoretical, and other more precise theories exist 
to describe many physical situations, semiclassical discussions will be used 
to make this book more easily accessible to readers without a background 
in quantum mechanics. 

Chapters 2-10 comprise the first part of this book. As mentioned 
above, they survey various direct energy conversion processes which convert 
to or from electricity and which do not involve magnets and coils. Table 
1.1 lists many of the processes studied along with where in the text they 
are discussed, and Table 1.2 lists some of the devices detailed. This text 
is not intended to be encyclopedic or complete. Instead, it is intended 
to highlight the physics behind some of the most widely available and 
accessible energy conversion devices which convert to or from electrical 
energy. One way to understand energy conversion devices used to convert 
to or from electricity is to classify them as most similar to capacitors, 
inductors, resistors, or diodes. While not all devices fit neatly in these 
categories, many do. The second column of Table 1.2 fists the category for 
various devices. Similarly, energy conversion processes may be capacitive, 
inductive, resistive, or diode-like. 

Capacitive energy conversion processes are discussed in Chapters 2 and 
3. Capacitors, piezoelectric devices, pyroelectric devices, and electro-optic 
devices are discussed. A piezoelectric device is a device which converts 
mechanical energy directly to electricity or converts electricity directly to 
mechanical energy [6] [3]. A material polarization and voltage develop 
when the piezoelectric device is compressed. A pyroelectric device converts 
a temperature differential into electricity [6]. The change in temperature 
induces a material polarization and a voltage in the material. Electro-optic 
devices convert an optical electromagnetic field to energy of a material po¬ 
larization. In these devices, an external optical field typically from a laser 
induces a material polarization and a voltage across the material. Chap¬ 
ters 4 and 5 discuss inductive energy conversion devices including antennas. 
Hall effect devices, and magnetohydrodynamic devices. An antenna con¬ 
verts electrical energy to an electromagnetic field or vice versa. A Hall 
effect device converts a magnetic field to or from electricity. A magnetohy- 
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Process 

Eorms of 

Energy 

Example 

Devices 

Discussed 

in 

Section 

Piezoelectricity 

Electricity 

Mechanical 

Energy 

Piezoelectric 

Vibration 

Sensor, Electret 
Microphone 

2.3 

Pyroelectricity 

Electricity 

t 

Heat 

Pyroelectric 

Infrared 

Detector 

3 

Electro-optic Effect 

Optical Electro¬ 
magnetic 

Energy 

Material 

Polarization 

Controllable 
Optics, Liquid 
Crystal Displays 

3.3 

Electromagnetic 
Transmission and 
Reception 

Electricity 

Electromagnetic 

Energy 

Antenna 

4 

Hall Effect 

Electricity 

Magnetic 

Energy 

Hall Effect 

Device 

5 

Magnetohydrody¬ 
namic Effect 

Electricity 

Magnetic 

Energy 

Magnetohydrody¬ 

namic 

Device 

5.3 

Absorption 

Optical Electro¬ 
magnetic 

Energy 

i 

Electricity 

Solar cell. 
Semiconductor 
Optical 
Photodetector 

6 


Table 1.1: Variety of energy conversion processes. 
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Process 

Eorms of 

Energy 

Example 

Devices 

Discussed 

in 

Section 

Spontaneous 

Emission 

Electricity 

1 

Optical Electro¬ 
magnetic 

Energy 

Lamp, LED 

7.3 

Stimulated 

Emission 

Electricity 

1 

Optical Electro¬ 
magnetic 

Energy 

Laser, Optical 
Amplifier 

7.4 

Thermoelectric 
Effects (Inch 
Seebeck, Peltier 
and Thomson) 

Electricity 

Heat 

Thermoelectric 
cooler, Peltier 
device. 

Thermocouple 

8.8 

(Battery or Enel 
Cell) Discharging 

Chemical 

Energy 

i 

Electricity 

Battery, Enel 

Cell 

9 

(Battery or Enel 
Cell) Charging 

Electricity 

1 

Chemical 

Energy 

Battery, Enel 

Cell 

9 

Thermionic 

Emission 

Heat 

1 

Electricity 

Thermionic 

Device 

10.2 

Electrohydrody¬ 
namic Effect 

Electricity 

t 

Eluid flow 

Microfluidic 
Pump, Valve 

10.6 


Table 1.1 continued: Variety of energy conversion processes. 




1 INTRODUCTION 


7 


Device 

Similar to 

Gompo- 

nent 

Eorms of Energy 

Discussed 

Section 

Piezoelectric 

Device 

Gapacitor 

Electricity 

Mechanical Energy 

2.3 

Pyroelectric Device 

Gapacitor 

Electricity 

Heat 

3 

Electro-optic 

Device 

Gapacitor 

Optical Energy 

Material Polarization 

3.3 

Antenna 

Inductor 

Electricity 

Electromagnetic 

4 

Hall Effect Device 

Inductor 

Electricity 

Magnetic Energy 

5 

Magnetohydrody¬ 
namic Device 

Inductor 

Electricity 

Magnetic Energy 

5.3 

Solar Cell 

Diode 

Optical Energy 

i 

Electricity 

6 

LED, Laser 

Diode 

Electricity 

i 

Optical Energy 

7 

Thermoelectric 

Device 

Diode 

Electricity 

Heat 

8.8 

Geiger Counter 

Diode 

Radiation 

i 

Electricity 

10.3 

Resistance Temp. 
Detector 

Resistor 

Heat 

f 

Electricity 

10.5 

Potentiometer 

Resistor 

Electricity 

i 

Heat 

10.5 

Strain Gauge 

Resistor 

Mechanical Energy 

i 

Electricity 

10.5 


Table 1.2: Variety of energy conversion devices. 
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drodynamic device converts kinetic energy of a conducting material in the 
presence of a magnetic field into electricity. 

Optical devices are discussed in Chapters 6 and 7. These chapters dis¬ 
cuss devices made from diode-like pn junctions such as solar cells, LEDs, 
and semiconductor lasers as well as other types of devices such as incandes¬ 
cent lamps and gas lasers. Thermoelectric devices convert a temperature 
differential into electricity [3, p. 146]. They are also made from junctions 
of materials in which heat and charges flow at different rates, and they are 
discussed in Chapter 8. Batteries and fuel cells are discussed in Chapter 9. 
A battery is a device which stores energy as a chemical potential. Batteries 
range in size from tiny hearing aid button sized batteries which store tens 
of milliamp-hours of charge to large car batteries which can store 10,000 
times as much energy. A fuel cell is a device which converts chemical en¬ 
ergy to electrical energy through the oxidation of a fuel [3]. During battery 
operation, the electrodes are consumed, and during fuel cell operation, the 
fuel and oxidizer are consumed instead. A variety of resistor-like energy 
conversion devices, among other devices, are discussed briefly in Chapter 
10. 

Chapters 11-14 comprise the second part of this book. These chapters 
are more theoretical, and they establish a mathematical framework for un¬ 
derstanding energy conversion. This mathematics allows relationships to 
be studied between energy conversion devices built by electrical engineers, 
mechanical engineers, chemists, and scientists of other disciplines. Chap¬ 
ters 11 and 12 introduce the idea of calculus of variations and apply it to 
a wide variety of energy conversion processes. Chapter 13 applies the idea 
of calculus of variations to energy conversion within an individual atom. 
Chapter 14 shows how a study of the symmetries of the equations pro¬ 
duced from calculus of variations can provide further insights into energy 
conversion processes. 
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1.3 Conservation of Energy 

Energy conservation is one of the most fundamental ideas in all of sci¬ 
ence and engineering. Energy can be converted from one form to another. 
Eor example, kinetic energy of a moving ball can be converted to heat by 
friction, or it can be converted to potential energy if the ball rolls up a 
hill. However, energy cannot be created or destroyed. The idea of energy 
conservation will be considered an axiom, and it will not be questioned 
throughout this book. Sometimes people use somewhat loose language 
when describing energy conversion. Eor example, one might say that en¬ 
ergy is lost to friction when a moving block slides along a table or when 
electricity flows through a resistor. In both cases, the energy is not lost 
but is instead converted to heat. Thermoelectric devices and pyroelectric 
devices can convert a temperature differential back to electricity. Someone 
might say that energy is generated by a coal power plant. What this phrase 
means is that chemical energy stored in the coal is converted to electrical 
energy. When a battery is charged, electrical energy is converted back to 
chemical energy. This imprecise language will occasionally be used in the 
text, but in all cases, energy conservation is assumed. While it might seem 
like an abstract theoretical law, energy conservation is used regularly by cir¬ 
cuit designers, mechanical engineers modeling mechanisms, civil engineers 
designing pipe systems, and other types of engineers. 

Efficiency of an energy conversion device, ^7e//, is defined as the power 
output of the desired energy type over the power input. 


Veff 


Pr 


out 


p. 

j- 17 


( 1 . 1 ) 


Efficiency may be written as a fraction or a percent. Eor example, if we say 
that an energy conversion device is 75% efficient, we mean that 75% of the 
energy is converted from the first form to the second while the remaining 
energy either remains in the first form or is converted to other undesired 
forms of energy. Energy conversion devices are rarely 100% efficient, and 
some commercial energy conversion devices are only a few percent efficient. 
Multiple related measures of efficiency exist where the input and output 
powers are chosen slightly differently. To accurately compare efficiency 
measures of devices, consistent of input and output power must be used. 
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1.4 Measures of Power and Energy 

This book brings together topics from a range of fields including chemistry, 
electrical engineering, and thermodynamics. Scientists in each branch of 
study use symbols to represent specific quantities, and the choice of vari¬ 
ables by scientists in one field often contradict the choice by scientists in 
another field. In this text, different fonts are used to represent different 
symbols. For example, S represents entropy, $ represents the Seebeck co¬ 
efficient, and S represents action. A fist of variables used in this text along 
with their units can be found in Appendix A. Use the tables in the appendix 
as tools. 

Power P and energy E are fundamental measures. Power absorbed by 
a system is the derivative of the energy absorbed with respect to time. 


P = 


dE 

dt 


( 1 . 2 ) 


In SI units, energy is measured in joules and power is measured in watts. 
While these are the most common measures, many other units are used. 
Every industry, from the petroleum industry to the food industry to the 
electrical power industry, seems to have its own favorite units. Tables 1.3 
and 1.4 list energy and power conversion factors. Values in the tables are 
from references [7] and [8]. 

Conversions between joules and some units, including calories, ergs, 
kilowatt hours, and tons of TNT are exact definitions [7]. The calorie 
is approximately the energy needed to increase the temperature of one 
gram of water by a temperature of one degree Celsius, but it is defined to 
be 4.1868 J [7]. Note that there is both a calorie and food calorie (also 
called kilocalorie). The food calorie or kilocalorie is typically used when 
specifying the energy content of foods, and it is a thousand times as large 
as the (lowercase c) calorie. Other conversions fisted in Table 1.3, including 
the conversion for energy in barrels of crude oil, are approximate average 
values instead of exact definitions [8]. Values in Table 1.3 are listed to 
the significant precision known or to four significant digits. Other inexact 
values throughout this text are also specified to four significant digits. The 
unit referred to as wave number, is discussed in Ch. 6. The conversion 
value listed in Table 1.3 for the therm is the US, not European, accepted 
value [7]. A barrel, used in the measure of crude oil, is 42 US gallons [8]. 
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1 J = 6.241508 ■ 10^® electron Volt, 
eV 

1 eV = 1.602176 ■ 10-i° J 

1 J = 10^ erg 

1 erg =10-'’ J 

1 J =0.7375621 foot pound-force 

1 foot pound-force =1.355818 J 

1 J = 0.23885 calories 

1 calorie =4.1868 J 

1 J = 9.47817 ■ 10“^ British thermal 
units, Btu 

1 Btu =1055.056 J 

1 J = 2.3885 ■ 10“"^ kilocalories 
(food calories) 

1 kilocalorie =4186.8 J 

1 J = 9.140 ■ 10“^ cubic feet of 
natural gas 

1 cubic foot of nat. gas 
=1.094-10° J 

1 J = 2.778 ■ 10“^ kilowatt hour, 
kW-h 

1 kW-h = 3.6 ■ 10° J 

1 J = 6.896 ■ 10“® gallons diesel fuel 

1 gallon diesel fuel =1.450 ■ 10® J 

1 J = 9.480434 ■ 10“® therm (US) 

1 therm (US) =1.054804 ■ 10® J 

1 J = 7.867 ■ 10“® gallons motor 
gasoline 

1 gallon motor gasoline 
=1.271 ■ 10® J 

1 J = 2.390 ■ 10-1° ton of TNT 

1 ton of TNT =4.184 ■ 10° J 

1 J = 1.658 ■ 10-1° barrels crude oil 

1 barrel crude oil =6.032 ■ 10° J 

1 J = 4.491 ■ 10-11 metric ton of 
coal 

1 metric ton coal =2.227 ■ 10^° J 

1 J = 1.986447 ■ 10-23 ^ 

1 ^ = 5.03411 ■ 1022 J 


Table 1.3: Energy unit conversion factors. 


1 W = 1 1 

1 1 = 1 W 

1 W = 1 ■ 10^ ^ 

1 ^ = 10-7 w 

1 W = 1.34 ■ 10-3 horsepower 

1 horsepower =7.46 ■ 102 W 

1 W = 2.655224 ■ 103 foot 
pound-force per h 

1 foot pound-force per h 
= 3.766161 ■ 10-^ W 


Table 1.4: Power unit conversion factors. 
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1.5 Properties of Materials 

1.5.1 Macroscopic Properties 

To understand energy conversion devices, we need to understand materi¬ 
als both microscopically on the atomic scale and macroscopically on large 
scales. A macroscopic property is a property that applies to large pieces of 
the material as opposed to microscopic sized pieces. 

One way to classify materials is based on their state of matter. Materials 
can be classified as solids, liquids, gases, or plasmas. A plasma is an ionized 
gas. Other more unusual states of matter exist such as Bose Einstein 
condensates, but they will not be discussed in this book. 



Crystalline Amorphous 


Polycrystalline 


Figure 1.1: Illustration of crystalline, amorphous, and polycrystalline 
atomic structure. 

We can further classify solids as crystalline, polycrystalline, or amor¬ 
phous based on the regularity of their atomic structure [9]. Figure 1.1 
illustrates these terms. In a crystal, the arrangement of atoms is periodic. 
The atoms may be arranged in a cubic array, hexagonal array, or some other 
way, but they are arranged periodically in three dimensions. In an amor¬ 
phous material, the arrangement of the atoms is not periodic. The term 
amorphous means glassy. A polycrystalline material is composed of small 
crystalline regions. These definitions can apply to materials made of single 
elements or materials made of multiple elements. Many energy conversion 
devices are made from very pure crystalline, amorphous, or polycrystalline 
materials. For example, amorphous cadmium telluride is used to make so¬ 
lar cells, and crystalline silicon is used to make Hall effect devices. Many 
materials, including both silicon and silicon dioxide, can be found in all 
three of these forms at room temperature. In crystalline and amorphous 
silicon, for example, the silicon atoms may have the same number of near¬ 
est neighbors, and the density of atoms in both materials may be the same, 
but there is no medium-range order in the amorphous material. Electrical 
properties of crystalline, amorphous, and polycrystalline forms of a mate- 











1 INTRODUCTION 


13 


rial may differ. Electrons can flow more easily through a pure crystalline 
material while electrons are more likely to be scattered or absorbed as they 
flow through an amorphous material, crystalline materials with impurities, 
or a crystalline material with crystal defects. 

We can further classify crystals as either isotropic or anisotropic [10, 
p. 210]. A crystal is isotropic if its macroscopic structure and material 
properties are the same in each direction. A crystal is anisotropic if the 
macroscopic structure and material properties are different in different di¬ 
rections. 

We can also classify materials based on how they behave when a voltage 
is applied across the material [11]. In a conductor, electrons flow easily 
in the presence of an applied voltage or electric held. In an insulator, 
also called a dielectric, electrons do not flow in the presence of an applied 
voltage or electric held. In the presence of a small external voltage or 
electric held, a semiconductor acts as an insulator, and in the presence of a 
strong voltage or electric held, a semiconductor acts as a conductor. Both 
solids and liquids can be conductors, and both solids and liquids can be 
insulators. For example, copper is a solid conductor while salt water is a 
liquid conductor. 

1,5.2 Microscopic Properties 

The electron configuration lists the energy levels occupied by electrons 
around an atom. The electron configuration can describe neutral or ion¬ 
ized atoms, and it can describe atoms in the lowest energy state or excited 
atoms. For example, the electron configuration of a neutral aluminum atom 
in the lowest energy state is ls^2s^2p®3s^3pb The electron configuration 
of an aluminum Al"^ ion in the lowest energy state is ls^2s^2p®3s^, and the 
electron configuration of a neutral aluminum atom with an excited electron 
can be written as ls^2s^2p®3s^4s^. 

Electrons are labeled by four quantum numbers: the principle quantum 
number, the azimuthal quantum number, the magnetic quantum number, 
and the spin quantum number [6] [12]. No two electrons around an atom 
can have the same set of quantum numbers. The principle quantum number 
takes integer values, 1, 2, 3 and so on. All electrons with the same principle 
quantum number are said to be in the same shell. The large numbers in 
the electron configuration refer to principle quantum numbers. The neutral 
aluminum atom in the lowest energy state has two electrons in the 1 shell, 
eight electrons in the 2 shell, and three electrons in the 3 shell. For most 
atoms, especially atoms with few electrons, electrons with lower principle 
quantum numbers both are spatially closer to the nucleus and require the 
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most energy to remove. However, there are exceptions to this idea for some 
electrons around larger atoms [13] [14]. 

Azimuthal quantum numbers are integers, and these values define sub- 
shells. For shells with principle quantum number n , the azimuthal quantum 
number can take values from 0 to n—1. In the electron configuration, values 
of this quantum number are denoted by lowercase letters: s=0, p=l, d=2, 
f=3, and so on. Magnetic quantum numbers are also integers, and these 
values define orbitals. For a subshell with azimuthal quantum number /, 
the magnetic quantum number takes values from —I to 1. In the electron 
configuration, superscript numbers indicate the magnetic quantum num¬ 
ber. Spin quantum numbers of electrons can take the values | and 
They are not explicitly denoted in the electron configuration. 

Consider again the neutral aluminum atom in the lowest energy state. 
This atom has electrons with principle quantum numbers n=l, 2, and 3. 
For electrons with principle quantum number 1, the only possible values for 
both the azimuthal quantum number and the magnetic quantum number 
are zero. The spin quantum number can take the values of ^ and Only 
two electrons can occupy the 1 shell, and these electrons are denoted by 
the Is^ term of the electron configuration. For the electrons with principle 
quantum number 2, the azimuthal quantum number can be 0 or 1. Two 
electrons can occupy the 2s orbital, and six electrons can occupy the 2p 
orbital. For the 3 shell, the azimuthal quantum number can take three 
possible values: s=0, p=l, and d=2. However since aluminum only has 13 
electrons, electrons do not have all of these possible values, so the 3 shell 
is only partially filled. The atoms in the rightmost column of the periodic 
table have completely filled shells. They are rarely involved in chemical 
reactions because adding electrons, removing electrons, or forming chemical 
bonds would require too much energy. 

Valence electrons are the electrons that are most easily ripped off an 
atom. Valence electrons are the electrons involved in chemical reactions, 
and electrical current is the flow of valence electrons. Other, inner shell, 
electrons may be involved in chemical reactions or electrical current only in 
cases of unusually large applied energies, and these situations will not be 
discussed in this text. Valence electrons occupy the subshell or subshells 
with the highest quantum numbers, and valence electrons are not part of 
completely filled shells. For the example of the neutral aluminum atom 
in the lowest energy state, the three electrons in the 3 shell are valence 
electrons. 

Where are the electrons around the atom spatially? This question is 
of interest to chemists, physicists, and electrical engineers. If we know the 
orbital of an electron, we have some information on where the electron 
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is likely to be found spatially around an atom. However, identifying the 
location of an electron with any degree of precision is difficult for multiple 
reasons. First, atoms are tiny, roughly 10“^° m in diameter. Second, at 
any temperature above absolute zero, atoms and electrons are continually 
in motion. Third, electrons have both particle-like and wave-like properties. 
Fourth, according to Heisenberg’s Uncertainty Principle, the position and 
momentum of an electron cannot simultaneously be known with complete 
precision. At best, you can say that an electron is most likely in some 
region and moves with some range of speed. Fifth, in many materials 
including conductors and semiconductors, valence electrons are shared by 
many atoms instead of bound to an individual atom [10, p. 544]. 


1.6 Electromagnetic Waves 

1.6.1 Maxwell’s Equations 

In this text, V and / denote DC voltage and current respectively while v 
and i denote AC or time varying voltage and current. In circuit analysis, 
we are unconcerned with what happens outside these wires. We are only 
interested in node voltages and currents through wires. Furthermore, the 
voltages and currents in the circuit are described as functions of time t 
but not position {x,y,z). Devices like resistors, capacitors, and inductors 
too are assumed to be point-like and not extended with respect to position 
{x, y, z). This set of assumptions is just a model. In reality, if two nodes in 
a circuit have a voltage difference between them, then necessarily a force 
is exerted on nearby charges not in the path of the circuit. This force per 
unit charge is the electric field intensity Similarly, if there is current 
flowing through a wire, there is necessarily a force exerted on electrons in 
nearby loops of wire, and this force per unit current element is the magnetic 
flux density ^. Energy can be stored in an electric or magnetic field. In 
later chapters, we will discuss devices, including antennas, electro-optic 
devices, photovoltaic devices, lamps, and lasers, that convert energy of an 
electromagnetic field to or from electricity. 

Four interrelated vector quantities are used to describe electromagnetic 
fields. These vector fields are functions of position {x, y, z) and time t. The 
four vector fields are 



x, r/, 2 :, t)=Electric field intensity in 


V 

m 



x, I/, 2 :, t)=Displacement flux density in 


m2 
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x, r/, t)=Magnetic field intensity in 


A 

m 



x, y, t)=Magnetic fiux density in 


Wb 

m2 


In these expressions, V represents the units volts, C represents the units 
coulombs, A represents the units amperes, and Wb represents the units 
webers. Additional abbreviations for units are listed in Appendix B. 
Coulomb’s law 

_ QlQ2^r /w o\ 

“ 47rer2 ^ ' '' 


tells us that charged objects exert forces on other charged objects. In this 
expression, Qi and Q 2 are the magnitude of the charges in coulombs. The 
quantity e is the permittivity of the surrounding material in units farads 
per meter, and it is discussed further in Sections 1.6.3 and 2.2.3. The 
quantity r is the distance between the charges in meters, and is a unit 
vector pointing along the direction between the charges. Force in newtons 
is represented by F. Opposite charges attract, and like charges repel. 
Electric field intensity is force per unit charge, so the electric field intensity 
due to a point charge is given by 




Qdr 

47rer2 


(1.4) 


These vector fields can describe forces on charges or currents in a circuit 
as well as outside the path of a circuit. Maxwell’s equations relate time 
varying electric and magnetic fields. Maxwell’s equations in differential 
form are: 






Faraday’s Law 






Ampere’s Law 


( 1 . 6 ) 

( 1 . 6 ) 


y.3 = pch Gauss’s Law for the Electric Field (1.7) 

Gauss’s Law for the Magnetic Field (1.8) 

The additional quantities in Maxwell’s equations are the volume current 
density J in and the charge density pch in In this text, we will not 
be solving Maxwell’s equations, but we will encounter references to them. 
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The quantity ^ is called the del operator. In Cartesian coordinates, it 
is given by 

^ d ^ d ^ d 

ax ay az 

When this operator acts on a scalar function, ^/, it is called the gradient. 
The gradient of a scalar function returns a vector representing the spatial 
derivative of the function, and it points in the direction of largest change in 
that function. In Maxwell’s equations, ^ acts on vector, instead of scalar, 
functions. The operation ^ x ^ is called the eurl, and the operation ^ ^ 
is called the divergenee. Both of these operations represent types of spatial 
derivatives of vector functions. The del operator obeys the identity 

V2 = ^.^. (1.10) 


The operation V^/ is called the Laplaeian of a scalar function, and it 
represents the spatial second derivative of that function. 


1,6.2 Electromagnetic Waves in Free Space 

Electromagnetic waves travel through empty space at the speed of light in 
free space, c = 2.998 ■ 10® and through other materials at speeds less 
than c. For a sinusoidal electromagnetic wave, the speed of propagation is 
the product of the frequency and wavelength 

|7 = /A (1.11) 

where |^| is the magnitude of the velocity in / is the frequency in Hz, 
and A is the wavelength in meters. In free space, Eq. 1.12 becomes 

c=/A. (1.12) 

The speed of light in free space is related to two constants which describe 
free space. 

c = (1-13) 

The permittivity of free spaee is given by eo = 8.854 ■ 10“^^ ^ where 
F represents farads, and the permeability of free spaee is given by /xq = 
1.257 ■ 10“® ^ where H represents henries. (Constants specified in this 
section and in Appendix A are rounded to four significant digits.) 

In free space, the electric field intensity ^ and the displacement flux 
density ^ are related by eo. 


^ = €o^ 


( 1 . 14 ) 
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Relatedly in free space, the magnetic field intensity n and the magnetic 
fiux density ^ are related by /xq- 





(1.15) 


1,6.3 Electromagnetic Waves in Materials 

Electromagnetic fields interact very differently with conductors and with 
insulators. Electromagnetic fields do not propagate into perfect conduc¬ 
tors. Instead, charges and currents accumulate on the surface. While no 
materials are perfect conductors, commonly encountered metals like cop¬ 
per and aluminum are very good conductors. When these materials are 
placed in an external electromagnetic field, surface charges and currents 
build up, and the electromagnetic field in the material quickly approaches 
zero. Electromagnetic fields propagate through perfect insulators for long 
distances without decaying, and no charges or currents can accumulate 
on the surface because there are no electrons free from their atoms. In 
practical dielectrics, electromagnetic waves propagate long distances with 
very little attenuation. Eor example, optical electromagnetic waves remain 
strong enough to detect after propagating hundreds of kilometers through 
optical fibers made of pure silicon dioxide [10, p. 886]. 

Resistance R in ohms, capacitance C in farads, and inductance L in 
henries describe the electrical properties of devices. Resistivity p in fim, 

rp u 

permittivity e in jjj, and permeability /i in describe the electrical prop¬ 
erties of materials. The quantities p, e, and p describe properties of materi¬ 
als alone while the quantities R, C, and L incorporate effects the material, 
shape, and size of a device. 

Resistivity p is a measure of the inability of charges or electromagnetic 
waves to propagate through a material. Conductors have a very small 
resistivity while insulators have a large resistivity. Sometimes electrical 
conductivity^ a = Mn units is used in place of the resistivity. Eor a 
device made of a uniform material with length I and cross sectional area 
A, resistance and resistivity are related by 

(1,16) 

Resistance is a measure of the inability of charges or electromagnetic waves 
to flow through a device while resistivity is a measure of the inability to 
flow through a material. 
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Parallel Plate Capacitor Partial Turn Inductor 

i ■ i ■ 



Figure 1.2: Geometry of a parallel plate capacitor and partial turn inductor. 


Permeability /i is a measure of the ability of a material to store energy 
in the magnetic field due to currents distributed throughout the material. 
Materials can also be described by their relative permeability a unitless 
measure. 



(1,17) 


While permeability describes a material, inductance describes a deviee. The 
magnetic flux density in a material is a scaled version of the magnetic field 
intensity. 

^ = fjJ^ (1T8) 


Often insulators have permeabilities close to /io while conductors used to 
make permanent magnets have significantly larger permeabilities. The right 
part of Fig. 1.2 shows a partial turn coil in a vacuum with length I, thickness 
dthick, and width w. The inductance and permeability of this device are 
related by [11, p. 311] 

(1.19) 

w 

Permittivity e is a measure of the ability of a material to store energy 
as an electric field due to charge separation distributed throughout the 
material. Materials can also be described by their relative permittivity e^, 
a unitless measure. 



( 1 , 20 ) 


The displacement flux density in a material is a scaled version of the electric 
field intensity. 

= ( 1 . 21 ) 


Some insulators have a permittivity hundreds of times larger than the per¬ 
mittivity of free space. Permittivity is a measure of ability to store energy 
in a material while capacitance is a measure of the ability to store energy 
in in a deviee. 
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A uniform parallel plate capacitor with cross sectional area of plates 
A = I ■ w and distance between plates dthick, is shown on the left part of 
Fig. 1.2, and it has capacitance 


C=— (1.22) 

^thick 

where e is the permittivity of the insulator between the plates. 

Permittivity, permeability, and resistivity, depend on frequency. In 
some contexts, the frequency dependence can be ignored, and through¬ 
out most of this text, these quantities will be assumed to be constants. 
In other contexts, the frequency dependence can be quite significant. For 
example, the permittivity of semiconductor materials is a strong function 
of frequency for frequencies close to the semiconductor energy gap. The 
permittivity e{u) and resistivity p{u) are not independent. If one of them is 
known as a function of frequency and p, is assumed constant, the other can 
be derived. This relationship is known as the Kramers Kronig relationship 
[10] or occasionally as the dielectric dispersion formula [15]. 

When discussing electrical properties of a device, resistance, inductance, 
and capacitance are combined into one complex measure, the impedance. 
Similarly, some authors find it convenient to combine resistivity, permit¬ 
tivity, and permeability into a pair of complex measures of the electrical 
properties of materials [6]. The complex permittivity is defined e* = e + jp, 
and the complex permeability is defined p* = p + jpmag ■ The quantity 
p represents the resistivity which is a measure of the energy converted to 
heat as a charge flows through a material due to an applied electrical field. 
The quantity p^ag represents an analogous measure of energy converted to 
heat from currents in an applied magnetic field. Complex permittivity and 
permeability will not be used in this text. 
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1.7 Problems 


1.1. A Ford Focus produces 160 horsepower [16]. Calculate the power 
produced in watts, and calculate the approximate energy produced 
by the vehicle in one hour. 

1.2. A gallon of gas contains 1.21 ■ 10® Btu and weighs 6 pounds [8]. 
Calculate the energy stored in the gallon of gas in joules, and calculate 
the specific energy in joules per kilogram. 

1.3. An Oreo cookie has 53 food calories and weighs 11 grams [17]. A ton 
(2000 pounds) of TNT contains approximately 4.184-10® J of energy 
[7]. Calculate the specific energy of the cookie in joules per kilogram, 
and calculate the specific energy of the TNT in joules per kilogram. 
(Yes, the value for the cookie is higher.) 

1.4. Find the electron configuration of an isolated indium atom in the 
lowest energy state. How many electrons are found around the atom? 
Repeat for a CU ion. 

1.5. Use a periodic table for this problem. 

(a) Which element has the electron configuration 
Is22s22p®3s23p®4s23d2? 

(b) List two elements which have exactly two valence electrons. 
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Part I 

Survey of Energy Conversion 
Devices 

2 Capacitors and Piezoelectric Devices 

2.1 Introduction 

This chapter begins with a discussion of material polarization, and then it 
discusses capacitors and piezoelectric devices. The next chapter discusses 
pyroelectric devices and electro-optic devices. All of these devices are all 
constructed from a thin dielectric layer, and operation of all of these devices 
involves establishing a material polarization, charge build up, throughout 
this dielectric material. In piezoelectric materials, mechanical strain causes 
a material polarization. As with many energy conversion devices, piezoelec¬ 
tric devices can work both ways, converting mechanical energy to electricity 
or converting electricity to mechanical vibrations. In pyroelectric devices, a 
temperature gradient causes the material polarization, and in electro-optic 
devices, an external optical electric field causes the material polarization. 

Why start the discussion of energy conversion devices with a discussion 
of capacitors? Capacitors are familiar to electrical engineers, and they are 
energy storage devices. How do capacitors work? What are the components 
of a capacitor? What materials are capacitors made out of? What are the 
differences between different types of capacitors such as mica capacitors 
and electrolytic capacitors? In an introductory circuits course, a capacitor 
is a device where the relationship between the current i and voltage v is 
given by 


i = 


C? 

dt 


( 2 . 1 ) 


and the capacitance C is just a constant. The only difference between one 
capacitor and another is the capacitance value. In order to answer these 
questions further, we need to go beyond this model. Through this study, 
we will gain insights into piezoelectric devices, pyroelectric devices, and 
electro-optic devices too. 
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2.2 Capacitors 

2.2.1 Material Polarization 

When an external voltage is applied across an insulator, charges separate 
throughout the material, and this charge separation is called a material 
polarization. Material polarization can be defined more precisely in terms 
of the electric field intensity ^ and the displacement fiux density Z^, two 
vector fields which show up in Maxwell’s equations, Eqs. 1.5 - 1.8. These 
vector fields are related by 

^ = e^. (2.2) 

Why do we define two electric field parameters when they are just scaled 
versions of each other? It is useful to separate the description of the elec¬ 
tric field inside a material from the description of the field in free space. 
Similarly, two vector fields describe magnetic field, the magnetic field inten¬ 
sity ^ and magnetic fiux density^, and these fields show up in Maxwell’s 
equations for the same reason. Material polarization, in units is 

defined as the difference between the electric field in the material D and 
the electric field that would be present in free space More specifically, 

^ ^ - eo^ (2.3) 

or 

= (e - eo) (2.4) 

These expressions involve the permittivity of free space eo and the permit¬ 
tivity of a material e which were defined in Sec. 1.6.3. 

Scientists overload both the words capacitance and polarization with 
multiple meanings. See Appendix C for more details on the different uses 
of these terms. 

2.2.2 Energy Storage in Capacitors 

When a capacitor is charged, energy is converted from electrical energy to 
energy stored in a material polarization which is energy of the charge sep¬ 
aration. When it is discharged, energy is converted from energy stored in 
the material polarization back to electrical energy of flowing electrons. Ca¬ 
pacitors are made from an insulating material between conducting plates. 
As we supply a voltage across the insulator, charges accumulate on the 
plates. The voltage built up is proportional to the charge accumulated on 
the plates. 


Q = Cv 


( 2 . 6 ) 
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In Eq. 2.5, Q is the charge in coulombs, v is the voltage, and the constant of 
proportionality is the capacitance C in farads. If we take the derivative with 
respect to time, we get the more familiar expression relating the current 
and voltage across the capacitor. 


dO . ^dv 
— = i = C — 
at at 


( 2 . 6 ) 


The capacitance of a capacitor is related to the permittivity of the 
dielectric material between the conductors. Permittivity is a measure of the 
amount of energy that can be stored by a dielectric material. As described 
by Eq. 1.22, for a parallel plate capacitor this relationship is 


C 


eA 

1 

thick 


(2.7) 


where A is the area of the plates and dthick is the distance between the 
plates. The energy E stored in a capacitor as a function of voltage applied 
across it is given by 

E = = ^Qv. (2.8) 

The capacitance of a vacuum-filled parallel plate capacitor is described by 
Eq. 2.7 with permittivity e = eo, the permittivity of free space. As we 
charge the capacitor, charges accumulate on the plates, and no change 
occurs to the vacuum between the plates. If we replace the vacuum with 
a dielectric with e > eo, the capacitance becomes larger. The dielectric 
filled capacitor can store more energy, all else equal, because the dielectric 
material changes as the capacitor charges. More specifically, the material 
polarizes. In an insulator, electrons are bound to their atoms, and current 
cannot flow. Instead, the electrons in a dielectric move slightly with respect 
to their nuclei while still staying bound to the atoms. Electrons are always 
in motion for materials at temperatures above absolute zero, but when a 
material polarizes, the net location of electrons with respect to the nuclei 
changes. As the capacitor charges, the electrons are slightly displaced from 
their atoms, balancing the charges on the plates, and more energy is stored 
in the dielectric for a given voltage. We say that this process induces 
electric dipoles. The larger the permittivity, e, the more the material can 
store energy by polarizing in this way. Eor this reason, capacitors are 
often filled with dielectric materials like tantalum dioxide Ta 205 which has 
e = 25eo [18]. A material with e = 25eo, for example, will be able to store 
25 times the energy of an air filled capacitor of the same size with the same 
applied voltage. 
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2,2.3 Permittivity and Related Measures 

For historical reasons, the permittivity may be expressed by different mea¬ 
sures. The electric susceptibility Xe, relative permittivity e^, index of re¬ 
fraction n, and permittivity e all describe the ability of a material to store 
energy in the electric field. Electric susceptibility is a unitless measure 
related to the permittivity by 


Xe = - - 1 (2.9) 

Co 

and relative permittivity is another unitless measure defined by 

= -. ( 2 . 10 ) 

Co 

With some algebra, we can write the material polarization in terms of the 
relative permittivity or the electric susceptibility. 

^ = (Cr — l)eo(^ = CoXe^ (2-11) 

Scientists studying optics often use index of refraction, another unitless 
measure which represents the ratio of the speed of fight in free space to the 
speed of light in the material. 

c speed of fight in free space , , 

n = V = - (2.12) 

1^1 speed of fight in material 

Since electromagnetic waves cannot travel faster than the speed of light 
in free space, index of refraction of a material is greater than one, n > 1. 
Assuming a material is a good insulator and p, = Ho, which are typically 
safe assumption for optics, the relationship between index of refraction and 
permittivity simplifies to 


n = v^- (2-13) 

Table 2.1 lists relative permittivities of some insulators used to make 
capacitors or piezoelectric devices. The values are all approximates. See 
the references cited for more detailed information. 

In the definitions of Section 1.6.3 and in Table 2.1, permittivity is 
treated as a scalar constant, but in some contexts a more complicated 
description is needed. In a crystalline material, a voltage applied along one 
crystallographic axis may induce charge separation throughout the material 
more easily than a voltage of the same size applied along a different axis. 
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Material 

Relative 
permittivity e,. 

Reference 

Vacuum 

1.0 

[3, p. 20 ] 

Teflon 

2.1 

[3, p. 20 ] 

Polyethylene 

2.3 

[3, p. 20 ] 

Paper 

3.0 

[3, p. 20 ] 

Si 02 

3.5 

[18] 

Mica 

6.0 

[3, p. 20 ] 

AI 2 O 3 

9 

[18] 

AlP 

10.2 

[9] 

ZrSi 04 

12.5 

[19] 

Si 

11.8 

[9] 

Ge 

16 

[9] 

Ta 205 

24 

| 20 | 

Zr 02 

25 

[18] 

Hf 02 

40 

[18] 

Ti 02 

50 

[18] 

PbS 

161 

[9] 

PbSe 

280 

[9] 

BaSrTiOa 

300 

[18] 

PbTe 

360 

[9] 


Table 2 . 1 : Approximate values of relative permittivity of various materials. 
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No applied voltage With applied voltage 


Figure 2.1: Illustration of material polarization. 


In such cases, the material is called anisotropic. Permittivity of anisotropic 
materials is more accurately described by a matrix. 

( ^xx ^xy 
€yx ^yy 
^zx ^zy 

The left part of Fig. 2.1 shows some atoms of a crystal. The small black 
circles represent the location of the nuclei of atoms in the crystals, and 
the gray circles represent the electron cloud surrounding the nuclei of each 
atom. If an electric field is applied in the dz direction, the material po¬ 
larizes, so the electrons are slightly displaced with respect to the nuclei as 
shown in the figure on the right. Since the spacing of atoms is different in 
the dx and dy direction than the dz direction, the external field required to 
get the same charge displacement will be different in the dx and dy direc¬ 
tions than the dz direction for this material. For this reason, the material 
illustrated in the figure is anisotropic, and the permittivity is best described 
by a matrix as opposed to a scalar quantity. 

2,2,4 Capacitor Properties 

Capacitors are energy conversion devices used in applications from stabiliz¬ 
ing power supplies, to filtering communication signals, to separating out a 
DC offset from an AC signal. Though capacitors and batteries both store 
electrical energy, energy in batteries is stored in the chemical bonds of 
atoms of the electrodes while energy is stored in capacitors in the material 
polarization from bound charges shifting in a dielectric layer. 

The first two measures to consider when selecting a capacitor to use in 
a circuit are the capacitance and the maximum voltage. A capacitor can 
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Capacitor Types 



-,...r...............■.. 

lO"'' 10® 1o' 10^ 10® 10“* 10® 

Maximum Voltage Rating in Volts 

Eigure 2.2: Range of capacitance and maximum voltage values for various 
capacitor types, following [21] and [22], 

be damaged if it is placed in a circuit where the voltage across it exceeds 
the maximum rated value. Approximate ranges for these parameters for 
capacitors with different dielectric materials are shown in Eig. 2.2. Capac¬ 
itance ranges are on the vertical axis, and maximum voltage ranges are on 
the horizontal axis. Eor example, electrolytic capacitors often can be found 
with capacitance values ranging from 10“^ to 1 E and maximum voltage 
ratings in the range of 1 to 1000 V. Similarly, ceramic capacitors can of¬ 
ten be found with capacitance values ranging from 10“^^ to 5 ■ 10“"^ E and 
maximum voltage ratings in the range of 1 to 50,000 V. 

While capacitance and maximum voltage rating are important param¬ 
eters to consider, they are not the only considerations. Another factor to 
consider is temperature stability. Ideally, the capacitance will be indepen¬ 
dent of temperature. However, all materials have a nonzero temperature 
coefficient. Ceramic and electrolytic capacitors tend to be more sensitive to 
temperature variation than polymer or vacuum capacitors [22]. Accuracy, 
or precision, is also important. Just as resistors are labeled with tolerances, 
capacitors may have tolerances of, for example, ±5% or ±10%. Another 
factor to consider is equivalent series resistance [23, ch. 1]. All materi¬ 
als have some resistivity, so all capacitors have some finite resistance. To 
account for the internal resistance, we can model any physical capacitor 
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2.2 Capacitors 



Figure 2.3: Natural mica. 


as an ideal capacitor in series with an ideal resistor, and the value of the 
resistor used is called the equivalent series resistance. Also, leakage of a 
capacitor should be considered [22]. If a capacitor is able to retain its 
stored charge for a long period of time, the capacitor has small leakage. 
If the capacitor discharges quickly even when disconnected from a circuit, 
it has large leakage. An ideal capacitor has no leakage [22]. Capacitors 
are also differentiated by their lifetime. An ideal capacitor operates for 
decades without degradation. However, some types of capacitors, such as 
electrolytic capacitors, are not designed to have long lifetimes [22]. Other 
factors to consider include cost, availability, size, and frequency response 
| 22 |. 

Ceramics, glasses, polymers, and other materials are used as the di¬ 
electric [22]. Often capacitors are classified by the dielectric material they 
contain [22]. Ceramic capacitors are small, cheap, and readily available 
[22]. They can often tolerate large applied voltages [22]. They typically 
have small capacitance values, poor accuracy, poor temperature stability 
and moderate leakage [22]. They have low equivalent series resistance and 
can withstand a lot of current, but they can cause transient voltage spikes, 
[23, ch. 1]. Some ceramic capacitors are piezoelectric. If these capacitors 
are vibrated, or even tapped with a pencil, noise will be introduced in the 
circuit due to piezoelectricity [23, ch. 12]. 

Mica is an interesting material which is used as a dielectric in capacitors. 
Figure 2.3 shows naturally occurring mica collected at Ruggles Mine near 
Grafton, New Hampshire. Mica comes in different natural forms including 
biotite and muscovite KAl 2 (AlSi 30 io)(OH )2 [24]. Mica is a flaky mineral 
with a layered structure [24], so mica capacitors can be made with very 
thin dielectric layers. Mica capacitors often have good accuracy and small 
leakage [22]. 

Capacitor dielectrics have been made from many types of polymers in- 
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Eigure 2.4: Through-hole size capacitors. 


eluding polystyrene, polycarbonate, polyester, polypropylene. Teflon, and 
mylar [22]. These capacitors often have good accuracy, temperature stabil¬ 
ity, and leakage characteristics [22]. 

Not all capacitors have solid dielectrics. A vacuum is a dielectric. Ca¬ 
pacitors with a vacuum dielectric are used in applications which involve 
high voltage or which require very low leakage [22]. Capacitors with liq¬ 
uid dielectrics made of oil are used in similar situations [22]. Electrolytic 
capacitors often have dielectrics which are a combination of solid materi¬ 
als with liquid electrolytes. An electrolyte is a liquid through which some 
charges can flow more easily than others. Electrolytic capacitors are polar¬ 
ized, meaning that they have positive and negative terminals, so, similar to 
a diode, the orientation of the capacitor in a circuit is important. Inside an 
electrolytic capacitor is a junction of multiple materials. The initial appli¬ 
cation of voltage in the factory chemically creates an oxide layer which is 
the dielectric. Reversing the voltage will dissolve the dielectric and destroy 
the capacitor. One advantage of electrolytic capacitors is that a small de¬ 
vice can have a large capacitance. However, they often have poor accuracy, 
temperature stability, and leakage [22]. Also, electrolytic capacitors have a 
finite lifetime because the liquid can degrade over time. 

2.3 Piezoelectric Devices 

Can we induce a material polarization in an insulator in a way that does 
not involve applying a voltage? If so, then this method can charge a ca¬ 
pacitor, and we can discharge the capacitor as usual to produce electricity. 
Any device that accomplishes this task is an energy conversion device. 
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Piezoelectric, pyroelectric, and electro-optic devices all involve this type 
of energy conversion, and they are all currently available as sensors and 
as other products. In piezoelectric devices, discussed in this section, a 
mechanical stress causes a material polarization. 

If a large enough strain is exerted on a material, the crystal structure 
will change. For example, at high enough temperature and pressure, coal 
will crystallize into diamond, and when the pressure is removed, the mate¬ 
rial stays in diamond form. Steel can be hardened by repeatedly hitting it 
in a process called shot peening. A significant amount of energy is needed 
to permanently change the crystal structure of a material. In this section, 
we are not discussing this effect. Instead, we are discussing an effect that 
typically requires little energy. When a mechanical strain is exerted on 
a piezoelectric device, a material polarization is established. The valence 
electrons are displaced, but the nuclei of the material and other electrons 
do not move. When we release the stress, the material polarization goes 
away. 

2.3.1 Piezoelectric Strain Constant 

We can describe the material polarization of a piezoelectric insulating ma¬ 
terial by incorporating a term which depends on the applied mechanical 
stress, [25]. 

^ = + (2.14) 

In this equation, is material polarization in ^ is displacement flux 
density in eo is the permittivity of free space in is the applied 

electric field intensity in d is the piezoelectric strain constant in y, and 
is the stress in pascals. Stress can also be given in other units. 

lPa=lj^ = l^ (2.15) 

111 

For many materials, the piezoelectric strain constant d is zero, and for 
many other materials, d is quite small. Barium titanate is used to make 
piezoelectric sensors because it has a relatively large piezoelectric strain 
coefficient, d ^ 3 ■ 10“^° Y [25, p. 408]. Additional example coefficients 
are given in the next chapter in Table 3.1. 

Mechanical strain is a unitless measure of deflection or deformation 
while stress has units pascals. Without an external electric field, these 
quantities are related by Young’s elastic modulus which has units 

strain = (—-^-—— ) - stress (2.16) 

\ Young s elastic modulus / 



2 CAPACITORS AND PIEZOELECTRIC DEVICES 


33 


If an electric field is also applied, stress and strain are related by 

strain = f —-^-—— ) ■ stress + ^ ■ d (2.17) 

\ Young s elastic modulus / 

where d is the piezoelectric strain constant. 

The energy stored in a piezoelectric device under stress is given by 

E=\^\-A-l-ri^ff (2.18) 

where A is the cross sectional area of a device in m^, / is the deformation in 
m, and ? 7 e// is the efficiency. Devices which are bigger, are deformed more, 
or are made from materials with larger piezoelectric constants store more 
energy. 

According to Eq. 2.14, the material polarization of an insulating crys¬ 
tal is linearly proportional to the applied stress. While this accurately 
describes many materials, it is a poor description of other materials. Eor 
other piezoelectric crystals, the material polarization is proportional to the 
square of the applied stress 

— eo + d\~^\ + dquad\~^\ (2.19) 

where dquad is another piezoelectric strain constant. To model the material 
polarization in other materials, terms involving higher powers of the stress 
are needed. 

2,3.2 Piezoelectricity in Crystalline Materials 

To understand which materials are piezoelectric, we need to introduce some 
terminology for describing crystals. Crystalline materials may be composed 
of elements, such as Si, or compounds, such as NaCl. By definition, atoms 
in crystals are arranged periodically. Two components are specified to 
describe the arrangement of atoms in a crystal: a lattice and a basis [25, 
p. 4]. A lattice is a periodic array of points in space. An n-dimensional 
lattice is specified by n lattice vectors for integer n. We can get from one 
lattice point to every other lattice point by traveling an integer number of 
lattice vectors. Three vectors, at, and are used to describe physical 
lattices in three-space. The choice of lattice vectors is not unique. Lattice 
vectors which are as short as possible are called primitive lattice vectors. A 
cell of a lattice is the area (2D) or volume (3D) formed by lattice vectors. 
A primitive cell is the area or volume formed by primitive lattice vectors, 
and it is the smallest possible repeating unit which describes a lattice. 
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Lattice 


Basis Crystal structure 



• • • 

© © © 


• • • 

© © © 

Figure 2.5: Two dimensional illustration of the terms lattice, basis, crystal 
structure, and primitive lattice vector. 

To specify the structure of a material, we attach one or more atoms to 
every point in the lattice. This arrangement of atoms is called a crystal 
basis. The lattice and crystal basis together define the crystal structure 
[25]. Figure 2.5 shows a two dimensional example of a lattice, crystal 
basis, and crystal structure. Since this example is two dimensional, only 
two lattice vectors are needed to specify the lattice. Two primitive lattice 
vectors are shown, and a primitive cell is shaded. 

There are 14 possible three dimensional lattice types, and these are 
called Bravais lattices [25]. Each of these possible lattices has a descriptive 
name. Figure 2.6 shows four of the possible Bravais lattices: simple cubic, 
body centered cubic, face centered cubic, and asymmetric triclinic. In the 
simple cubic lattice, all angles between fine segments connecting nearest 
neighbor points are right angles, and all lengths between nearest neighbor 
points are equal. In the asymmetric triclinic lattice, none of these angles 
are right angles, and none of these lengths between nearest neighbor points 
are equal. Figure 2.6 shows lattice cells, but the cells for the body centered 
cubic and face centered cubic lattices are not primitive cells because smaller 
repeating units can be found. 

Consider some example lattices and crystal structures. The crystal 
structure of sodium chloride, for example, involves a face centered cubic 
lattice and a basis composed of one sodium and one chlorine atom. An¬ 
other example is silicon which crystallizes in what is known as the diamond 
structure [25]. This crystal structure involves a face centered cubic lattice 
and a basis composed of two silicon atoms, at location (0, 0, 0) and 
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Simple cubic: 


Body centered cubic: 



Face centered cubic: Asymmetric triclinic: 



Eigure 2.6: Illustration of some Bravais lattices. 


where I is the length of the primitive cell. Carbon, Si, Ge, and Sn all crys¬ 
tallize in this diamond structure with cell lengths of / = 0.356, 0.543, 0.565, 
and 0.646 nm respectively [25]. 

While there are only 14 possible three dimensional lattices, there are 
significantly more possible crystal structures because the crystal structure 
also incorporates the basis. R is not possible to list all possible crystal 
structures. Instead, they are classified based on the symmetries they con¬ 
tain. Possible symmetry operations are 2-fold, 3-fold, 4-fold, and 6-fold 
rotations, horizontal and vertical mirror planes, and inversion. Crystal 
structures are grouped based on the symmetry elements they contain into 
classes called crystal point groups. There are 32 possible crystal point 
groups, and they are listed in the Table 2.2. 

Some authors classify crystal structures into crystal space groups instead 
of crystal point groups [6] [26]. While there are 32 crystal point groups, 
there are 230 crystal space groups. Crystal space groups are based on 
symmetry transformations which can incorporate not only rotations and 
mirror planes but also combination of translations along with rotations 
and mirror planes. Crystal space groups will not be discussed further in 
this text. 
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Table 2.2: Summary of crystal point groups. 
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Table 2.2 continued: Summary of crystal point groups. 
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2.3 Piezoelectric Devices 



Figure 2.7: Shapes used to illustrate symmetry elements. 



As an example of identifying symmetry elements, consider the 2D shapes 
in Fig. 2.7. The T-shaped figure has one symmetry element, a mirror plane 
symmetry. The shape looks the same if it is refiected over the mirror plane 
shown in the figure by a dotted fine. The Q-shape has no symmetry ele¬ 
ments. The hexagon has multiple symmetry elements. It contains 2-fold 
rotation because it looks the same when rotated by 180°. It also has 3-fold 
and 6-fold rotation symmetries because it looks the same when rotated by 
60° and 30° respectively. It also has multiple mirror planes shown by dot¬ 
ted lines in the figure. In this example, symmetry elements of 2D shapes 
are identified, but material scientists are interested in identifying symme¬ 
tries of 3D crystal structures to gain insights in the properties of materials. 
Materials are classified into categories called crystal point groups based on 
the symmetries of their crystal structures. 

We generalize about crystalline materials based on whether or not their 
crystal structure possesses inversion symmetry. What is the inversion op¬ 
eration? In 2D, inversion is the same as a rotation by 180°. In 3D, a 
shape or crystal structure contains inversion symmetry if it is identical 
when rotated by 180° and inverted through the origin [24, p. 269]. More 
specifically, draw a vector from the center of the shape to any point 
on the surface. If the shape has inversion symmetry, then for any such 
vector I^, the point a distance from the origin is also on the surface 
of the shape. The example on the left of Fig. 2.8 has inversion symmetry 
because for any such vector ^ from the center of the sh^e to a point on 
the surface, there is a point on the surface a vector — V away from the 
origin too. The example on the right does not contain inversion symmetry 
as illustrated by the vector shown by the arrow. 

If a crystal structure has inversion symmetry, we say the crystal has 
a eenter of symmetry otherwise we say it is noneentrosymmetrie. Crystal 
structures are classified into classes called crystal point groups, and twenty- 
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With Inversion Symmetry 


Without Inversion Symmetry 



Eigure 2.8: The shape on the left contains inversion symmetry while the 
shape on the right does not. 


one of the 32 point groups have no center of symmetry thus do not contain 
inversion symmetry [24, p. 35]. Twenty of these crystal point groups have 
a polar axis, some axis in the crystal with different forms on opposite ends 
of the axis. These twenty one crystal point groups are specified as noncen- 
trosymmetric in the sixth column of Table 2.2. If we mechanically stress 
these materials along the polar axis, different amounts of charges will build 
up on the different sides of the axis. Dielectric crystalline materials whose 
crystal structure belongs to any one of these 21 of these noncentrosymmet- 
ric crystal point groups are piezoelectric [24]. 

Table 2.2 lists all crystal point groups and summarizes whether crys¬ 
talline materials whose crystal structure belongs to each group can be 
piezoelectric, pyroelectric, and Pockels electro-optic. Pyroelectricity and 
electro-optics are discussed in the next chapter. Information in the table 
comes from references [24] [26] [27] [28]. The left two column list the 32 
possible crystal point groups. There are two different, but equivalent, ways 
of labeling the crystal point groups. The first column names the crystal 
point groups using Hermann-Mauguin notation. This notation dates to the 
1930s and is used by chemists, mineralogists, and some physicists. The 
second column names the crystal point groups using Schoenfiies notation. 
Schoenfiies notation dates from 1891 [29], and it is used by mathematicians, 
spectroscopists, and other physicists. 

The third column of Table 2.2 lists the crystal system. As shown in Pig. 
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Figure 2.9: Labels on a primitive cell of a lattice. 


2.9, the angles of a primitive cell of a lattice are labeled a, /3, and 7 , and 
the lengths of the sides are labeled a, b, and c. Crystal point groups can 
be classified based on the angles and lengths of the primitive cell of the 
lattice which belongs to that group. The literature contains multiple sub¬ 
tly different ways of defining crystal systems [30]. The information in the 
third column follows reference [28]. The fourth column gives relationships 
between the angles of the primitive cell. The fifth column gives relation¬ 
ships between the lengths of sides of the primitive cell. Combinations of 
angles and lengths are not unique to a specific row. For example, classes 
C 2 and Cih both have a = 90°, 7 ^ 90°, 7 = 90°, and a ^ b ^ c. However, 

crystal structures belonging to these crystal point groups contain different 
symmetry elements. For more details on specifically which symmetry el¬ 
ements are contained in which crystal point group, see [24] [26] [27] [28]. 
The sixth column fists whether or not the crystal point group has inver¬ 
sion symmetry. Crystal structures with no inversion symmetry or center of 
symmetry, called noncentrosymmetric, are both piezoelectric and Pockels 
electro-optic. The last column fists whether or not crystalline materials 
whose crystal structure belongs to the various crystal point groups can be 
pyroelectric. 

It is possible to start with a crystal structure of a material, derive 
the symmetry elements it contains, derive whether or not the material is 
piezoelectric, and derive whether or not the material is pyroelectric. Fur¬ 
thermore, it is possible to derive along which axes piezoelectricity or py¬ 
roelectricity can occur in the material. However, this derivation is beyond 
the scope of this book. For further details, see [ 6 ] [27] [28] [31]. 

To predict whether or not a dielectric crystalline material is piezoelec¬ 
tric, identify its lattice and crystal basis to identify its crystal structure. 
Identify the symmetries of the crystal structure to classify its crystal struc¬ 
ture into a particular crystal point group. If that crystal structure contains 
inversion symmetry, the material can be piezoelectric. We often do not 
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have to go through all of these steps because the crystal point group for 
many crystalline materials is tabulated [32], Even if the crystal structure 
for a material contains inversion symmetry, the piezoelectric effect and the 
piezoelectric strain coefficient d may be too small to measure. 

The effect may only occur when you stress the material along some 
particular axis, and it may not occur for a mechanical stress of an arbi¬ 
trary orientation with respect to the direction of the crystal axes. There is 
only one crystal point group, called asymmetric triclinic, where a random 
stress will produce a material polarization [24]. Eor all other point groups, 
only stresses along certain axes will produce a material polarization [24]. 
Eurthermore, in most crystals a given amount of stress along one axis of 
the crystal will produce a different amount of material polarization than 
the same amount of stress applied along a different crystallographic axis. 
Qualitatively, compressing a crystal along one axis may cause more charge 
displacement than compressing a crystal along a different axis. Eor this 
reason, it is more accurate to treat the piezoelectric strain coefficient as a 
matrix. This 3x6 matrix has elements 

(9strain along k 
(9electric field along i 

where electric field has x, y, and 2 : components, and the stress can be 
applied along the xx, xy, xz, yy, yz, or directions. 

2,3.3 Piezoelectricity in Amorphous and Polycrystalline Mate¬ 
rials and Ferroelectricity 

The previous section discussed piezoelectricity in crystals. We can discuss 
symmetries of the crystal structure of crystalline materials, but we can¬ 
not even define a crystal structure for amorphous materials. However, it is 
possible to make piezoelectric devices out of polycrystalline and amorphous 
materials. In a dielectrics, if we apply an external electric field, a material 
polarization is induced. Electric dipoles form because the electrons and 
nuclei of the atoms displace slightly from each other. Coulomb’s law tells 
us that charge buildups, such as these electric dipoles, induce an electric 
field. So, if we apply an external electric field to a dielectric, this primary 
effect induces a material polarization, and this material polarization will, 
as a secondary effect, induce additional material polarization in the ma¬ 
terial. Once one atom polarizes forming an electric dipole, nearby atoms 
will polarize. Small regions of the same material polarization are called 
electrical domains. 



for a given stress 


( 2 . 20 ) 
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In certain dielectric materials, an external mechanical stress induces a 
local material polarization. The charge buildup of this material polarization 
induces a material polarization in nearby atoms forming electrical domains 
[23]. This piezoelectric effect can occur whether the original material is 
crystalline, amorphous, or polycrystalline [23]. In noncrystalline materials, 
this effect is necessarily nonlinear, so these materials are not well described 
by Eqs. 2.14 or 2.19. 

The nonlinear process of a material polarization of one atom inducing 
a material polarization of nearby atoms causing the formation of electrical 
domains is called ferroelectricity. Ferroelectric materials may be crys¬ 
talline, amorphous, or polycrystalline. We will see in the next chapter that 
materials can be ferroelectric pyroelectric and ferroelectric electro-optic in 
addition to ferroelectric piezoelectric. The ferroelectric effect is limited by 
temperature. For many ferroelectric materials, these effects occur only be¬ 
low some temperature, called the Curie temperature. When the materials 
are heated above the Curie temperature, the ferroelectric effect goes away 
[33]. The material polarization of a ferroelectric material may depend on 
whether or not a material polarization has previously been induced. If the 
state of a material depends on its past history, we say that the material has 
hysteresis. Ferroelectric materials may have a material polarization even 
in the absence of an external mechanical stress or electric field if a source 
of energy has previously been applied. 

While the prefix ferro- means iron, most ferroelectric materials do not 
contain iron, and most iron containing materials are not ferroelectric. The 
word ferroeleetrie is used as an analogy to the word ferromagnetie. Some 
iron containing materials are ferromagnetic. If an external magnetic field is 
applied across a ferromagnetie material, an internal magnetic field is set up 
in the material. Ferromagnetic materials can have a permanent magnetic 
dipole even in the absence of an applied magnetic field. We can model an 
electric dipole as a pair of charges. We can model a magnetic dipole as a 
small current loop. Ferromagnetic materials exhibit hysteresis, and they 
have magnetic domains where the magnetic dipoles are aligned. 

Originally, a piezoelectric ferroelectric material has randomly aligned 
electrical domains and no net material polarization, so it starts out as 
neither piezoelectric or ferroelectric. The process of causing a material 
to exhibit piezoelectricity and ferroelectricity is called poling. To pole 
a material, place it in a strong external electric field [23], for example, 
across the poles of a battery, hence the term. Poling does not change 
the atomic structure, so if the material was originally amorphous, it will 
remain amorphous. During this process, electrical domains form, and these 
domains remain even when the external field is removed. A material that is 
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piezoelectric due to this type of poling is sometimes called an electret [15, 
p. 297]. After the material is poled, it may have a net material polarization 
throughout. Eurthermore after poling, it is piezoelectric and ferroelectric, 
so an external mechanical stress induces a material polarization locally and 
throughout the material. 

2,3.4 Materials Used to Make Piezoelectric Devices 

What makes a good material for a piezoelectric sensor or piezoelectric en¬ 
ergy conversion device? Eirst piezoelectric devices are made from electrical 
insulators. When an external voltage is applied across a conductor, va¬ 
lence electrons are removed from their atoms, so no material polarization 
accumulates. Second, piezoelectric devices are made from materials with 
large piezoelectric strain constants. The piezoelectric strain constant is so 
small that it cannot be detected in many crystals with crystal structures 
from one of the 21 crystal point groups known to be piezoelectric, and it 
is zero in crystals from the other crystal point groups. Third, piezoelectric 
devices should be made from materials that are not brittle so that they can 
withstand repeated stressing without permanent damage. Thermal prop¬ 
erties may also be important [33]. There is no material that is best in all 
applications. 

Quartz, crystalline Si02, was the first material in which piezoelectricity 
was studied. Pierre and Jacques Curie discovered the effect in quartz in the 
1880s [3]. Today, many piezoelectric devices, including crystal oscillators, 
are made from quartz. Lead zirconium titanate is another material used due 
to its relatively high piezoelectric strain constant [3] [34]. In applications 
which require flexibility and the ability to withstand repeated mechanical 
stress without damage, polymers such as polyvinyldenfiuoride are used [25]. 
Piezoelectricity has also been studied in materials including barium titanate 
BaTiOa, lithium niobate, tourmaline 

(Na,Ca)(Li,Mg,Al)3(Al,Pe,Mn)g(B03)3(Si608)(0H)4, 
and Rochelle salt 

KNaC4H406 ■ 4 H 2 O 

|3| |23| |24| |34|. 

Manufacturers of piezoelectric devices do not often label their products 
to say whether they are made from crystalline, amorphous, or polycrys¬ 
talline materials, but there are advantages and disadvantages to the dif¬ 
ferent types of materials. An advantage of making piezoelectric devices 
from polycrystalline or amorphous materials is that the devices can be 
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made more easily into different shapes such as cylinders and spheres [33]. 
However, the materials used often have lower melting temperatures, higher 
temperature expansion coefficients, and are more brittle [33]. Crystalline 
materials, such as quartz, have the advantages of being harder and having 
a higher melting temperature [33]. 

2.3.5 Applications of Piezoelectricity 

A number of electrical components involve piezoelectricity. When a voltage 
is applied across a piece of piezoelectric material, it mechanically bends and 
deforms. When the voltage is released, it springs back at a natural reso¬ 
nant frequency. This material can be integrated with a feedback circuit 
to produce oscillations at a precise frequency. Electrical oscillators of this 
type are often made from crystalline quartz. A more recent application 
is the piezoelectric transformer. These devices are used in the cold cath¬ 
ode fluorescent lamps which are used as backlight for LCD panels [23, p. 
289]. The lamps require around a thousand volts to turn on and hundreds 
of volts during use. Transformers made of magnets and coils can achieve 
these high voltages, but piezoelectric transformers are much smaller, small 
enough to be mounted on a printed circuit board. A traditional trans¬ 
former involves a pair of coils, and it converts AC electricity to magnetic 
energy to AC electricity at a different voltage. Similarly, a piezoelectric 
transformer also involves multiple energy conversion processes. In such a 
device, AC electricity is converted to mechanical vibrations and then to 
AC electricity at a different voltage. Energy is conserved in these devices, 
so they can produce high voltages with low currents. Eigure 2.10 shows 
a piezoelectric transformer that can convert an input of 8 to 14 V to an 
output up to 2 kV [35]. Eigure 2.11 shows an example of some small piezo¬ 
electric circuit components. Starting in the upper left and going clockwise, 
a microphone, ultrasonic transmitter and receiver, vibration sensor, and 
oscillator are shown. 

Efficiency of energy conversion devices is hard to discuss because every 
author makes different assumptions. However by any measure, efficiency of 
a commercial piezoelectric device is low, often 6% or less [36]. Due to this 
low efficiency, many piezoelectric devices are used as sensors. Regardless 
of this low efficiency, other devices are used for energy harvesting. Eor 
example, one train station embedded piezoelectric devices in the platforms 
to generate electricity. Piezoelectric devices also have been used to convert 
the energy from the motion of a fluid or from wind directly to electricity 
|36|. 

There is interest in using piezoelectric devices for biomedical applica- 
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Eigure 2.10: A piezoelectric transformer that takes an input of 8-14 V and 
produces and output of up to 2 kV. This picture is used with permission 
from [35]. 



Eigure 2.11: Example small piezoelectric devices. Clockwise from top left: 
electret microphone, ultrasonic distance sensor, vibration sensor, oscillator 
crystal. 
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tions. Quartz is piezoelectric, and it is durable, readily available, and 
nontoxic. Engineers have developed piezoelectric devices designed for use 
outside of the body and to be implanted inside the body. Some piezoelectric 
devices are used as sensors. For example, piezoelectric sensors can monitor 
knees or other joints [3]. Also, ultrasonic imaging is a common diagnostic 
technique. Piezoelectric devices are used both to generate the ultrasonic 
vibrations and to detect them [33]. Other biomedical piezoelectric devices 
are used as a source of electrical power. Artificial hearts, pacemakers, and 
other devices require electricity, and they are often limited by battery tech¬ 
nology available to supply the energy [36]. Piezoelectric generators have 
no moving parts to wear out, and they can avoid the problem of needing 
to change the batteries. Physical activity can be classified as continuous, 
such as breathing, or discontinuous, such as walking. Both types of physi¬ 
cal activity can be used as a source of mechanical energy for piezoelectric 
devices [36]. The amount of power required for different biomedical devices 
varies quite a bit. For example, an artificial heart may require around 8 W 
while a pacemaker may require only a few microwatts [36]. Piezoelectric 
devices may be able to capture energy from typical physical activity and 
convert it into electrical energy to power the device. A piezoelectric device 
in an artificial knee has produced 0.85 mW [36], and a device in a shoe has 
generated 8.4 mW from walking [36]. 

Piezoelectric devices are used in other types of imaging systems besides 
biomedical imaging systems. One of the earlier applications was in sonar 
systems. Around the time of WWI, the military actively developed sonar 
systems to detect boats and submarines. Today, sonar systems are used to 
detect fish and to measure the depth of bodies of water [33]. Sonar imaging 
is also used to analyze electrical circuits and to detect imperfections and 
cracks in steel and in welds [33]. 

Piezoelectric devices are used in a variety of other applications too. 
Piezoelectric sensors are used in some buttons and keyboards [36]. Piezo¬ 
electric devices are used to make accelerometers [37, p. 353], and they are 
used to measure pipe flow [33]. Speakers, microphones and buzzers can all 
be made from piezoelectric devices, and they can operate at both audio 
and ultrasonic frequencies. Piezoelectric devices that generate ultrasonic 
signals can be used to emulsify dyes, paints, and food products like peanut 
butter [33]. Also, they are used in some barbecue grill ignitions where 
mechanical stress induces an electric spark [23, ch. 15]. 
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2.1. A parallel plate capacitor has a capacitance of O = 10 pE. The plates 
have area 0.025 cm^. A dielectric layer of thickness dthick = 0.01 mm 
separates the plates. Eor the dielectric layer, calculate the permit¬ 
tivity e, the relative permittivity e^, and the electric susceptibility 

Xe- 

2.2. We often assume that the capacitance of a capacitor and the permit¬ 
tivity of a material are constants. However, sometimes these quan¬ 
tities are better described as functions of frequency. Consider a ca¬ 
pacitor made from parallel plates of area 0.025 cm^ separated by 0.01 
mm. Assume that for oj < 10® the capacitance is well modeled 
by 

C{u) = 8 ■ 10-1^ + 3 ■ 10-i®a; 

in farads. Eor the dielectric material between the plates of the capac¬ 
itor, calculate the permittivity e{u), the relative permittivity er{uj), 
and the electric susceptibility Xe{^)- 

2.3. A cylindrical sandwich cookie has a radius of 0.75 in. The cookie is 
made from two wafers, each of thickness 0.15 in, which are perfect 
dielectrics of relative permittivity = 2.8. Between the wafers is a 
layer of cream filling of thickness 0.1 in which is a perfect dielectric 
of relative permittivity = 2.2. Eind the overall capacitance of the 
cookie. 

Hint: Capacitances in series combine as . 

2.4. A parallel plate capacitor has a capacitance of 3 /iE. 

(a) Suppose another capacitor is made using the same dielectric 
material and with the same cross sectional area. However, the 
thickness of the dielectric between the plates of the capacitor is 
double that of the original capacitor. What is its capacitance? 

(b) Suppose a third capacitor is made with the same cross sectional 
area and thickness as the first capacitor, but from a material 
with twice the permittivity. What is its capacitance? 
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F 

2.5. A piezoelectric material has a permittivity of e = 3.54 ■ 10“ ^ and 

has a piezoelectric strain constant of d = 2 ■ 10“^° y. If the material 

is placed in an electric field of strength |^| = 70 ^ and is subjected 
to a stress of |"^| = 3.5 Calculate the material polarization. 

2.6. A piezoelectric material has permittivity = 2.5. If the material is 
placed in an electric field of strength |^| = 2-10^ ^ and is subjected 
to a stress of | ”7^1 = 200 the material polarization of the material 
is 3.2 • 10“^ Calculate d, the piezoelectric strain constant. 

2.7. Consider two piezoelectric devices of the same size and shape. The 
dielectric material of the first device has a permittivity of e = 2.21 ■ 
10“^^ m ^ piezoelectric strain constant of d = 8 ■ 10“^^ y. The 
dielectric material of the second device has an electric susceptibility 
of Xe =3.2 and a piezoelectric strain constant of d = 2 ■ 10“^° y . 

(a) Find e^, the relative permittivity, for each device. 

(b) Find the ratio of the capacitance of the first device to the 
capacitance of the second device. 

(c) The devices are placed in an external electric field of strength 
|i7| = 32 No stress is placed on the devices. Calculate the 
material polarization, for each device. 

(d) The devices are placed in an external electric field of strength 
|i7| = 32 and a stress of |T^| = 100 ^ is applied to the 
devices. Calculate the material polarization, for each device. 

(e) Which device would you expect is able to store more energy? 
Explain your answer. 

2.8. A particular piezoelectric device has a cross sectional area of 10“® m^. 
When a stress of 800 ^ is applied, the device compresses by 10 /im. 
Under these conditions, the device can generate 2.4-10“® J. Calculate 
the efficiency of the device. 

2.9. A particular piezoelectric device has a cross sectional area of 10“^ m^ 
and an efficiency of 5%. When a stress of 1640 ^ is applied to the 
device, it oscillates with an average velocity of 0.01 Calculate the 
power that can be generated from the device. 
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2.10. A piezoelectric device is placed in an electric field of strength |^| 


In the first test, a stress of 


500 The device is tested twice. 

|T^| = 1000 ^ was put on the device, and the material polarization 
7^I = 2.75 ■ 10“® In the second test also with 




was measured to be 
1^1 = 500 a stress of | = 100 ^ was put on the device, and the 

material polarization was measured to be |i^| = 6.50 ■ 10 ® Eind 
the piezoelectric strain constant d, and find the relative permittivity 
of the material e^. 


2.11. According to the data sheet, a piezoelectric device is 3% efficient. A 
coworker says that energy is not conserved in the device because 97% 
of the energy is lost when it is used. Explain what is wrong with your 
coworker’s explanation. 

2.12. Match the material property with its definition. (Not all definitions 
will be used.) 


1 . A mechanical stress will cause a 
(material) polarization in this type of 
material. 

2. This type of material is glassy and 
noncrystalline. 

3. Charges do not easily how through 
this type of material 

4. In the presence of a weak external 
voltage, charges do not how in this type 
of material. In the presence of a strong 
external voltage, charges how easily. 

5. A material polarization in one atom 
induces material polarization in nearby 
atoms in this type of material. 


A. Amorphous 


B. Dielectric 


C. Eerroelectric 


D. Piezoelectric 





50 


2.4 Problems 


2.13. Consider a piezoelectric material in an external electric field E in 
units The figure shows the magnitude of the material polariza¬ 
tion, in units as a function of the strength of the external 

electric field when no mechanical stress is applied. The material has 
a piezoelectric strain constant of d = 5 ■ 10“^° 


(a) Find the relative permittivity e^, and find the electric suscepti¬ 
bility Xe- 

(b) Find and plot an expression for the magnitude of the material 
polarization as a function of the external electric field strength 
when a stress of 1000 ^ is applied. Label the axes of your plot 
well. 

(c) This material is used to make a piezoelectric device with a cross 
sectional area of 1 cm^. When this device is compressed a dis¬ 
tance of 1 mm, an energy of 2 ■ 10“^° J is stored. Find the 
efficiency of the device. 
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2.14. Consider the 2D crystal structure, shown in the figure, composed of 
a lattice and a crystal basis. The crystal basis is composed of two 
atoms of type A and one atom of type B. 

(a) Sketch the crystal basis. 

(b) Sketch the 2D lattice. 

(c) Draw two primitive vectors Oi and 02 on your sketch. 


@0 00 00 00 

0 0 0 0 

00 00 00 00 

0 0 0 0 

00 00 00 00 

0 0 0 0 


2.15. Consider the illustrations of the crystal structure of two 2D materials 
where X and O represent the location of different types of atoms. Do 
the materials have the same crystal structure? basis? lattice? crystal 
point group? Answer yes or no, and explain. 

Material 1: Material 2: 
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2.16. The figure below illustrates two possible crystal lattices: a face cen¬ 
tered cubic lattice and a body centered cubic lattice. The solid arrows 
represent lattice vectors, but not primitive lattice vectors, and the 
cells shown are not primitive cells. The dotted vectors in the figure 
show primitive lattice vectors. In the case of the face centered cubic 
lattice, the primitive lattice vectors go from a corner point to a point 
on in the middle of one of the faces of the cube. In the body centered 
cubic lattice, the primitive lattice vectors go from a corner point to 
a point in the center of a cell bordering that corner. Suppose that 
the solid vectors have length 0.4 nm. Find the length of the primitive 
lattice vector in the face centered cubic lattice, and find the length 
of the primitive lattice vector in the body centered cubic lattice. 

Face Centered Cubic Body Centered Cubic 
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3 Pyroelectrics and Electro-Optics 

3.1 Introduction 

Electrical engineers interested in materials often focus their study on semi¬ 
conductors or occasionally conductors. However, energy conversion devices 
are made out of all types of materials. In the last chapter we discussed ca¬ 
pacitors and piezoelectric devices. Both are constructed from a layer of 
insulating material between conductors. The properties of this dielectric 
layer determine the properties of the resulting devices. This chapter dis¬ 
cusses two additional types of devices that involve material polarization 
of insulators, pyroelectric devices and electro-optic devices. As with other 
types of energy conversion devices, these can operate two ways. Pyroelec¬ 
tric devices can convert a temperature difference to a material polarization 
and therefore electricity, or they can convert a material polarization to a 
temperature difference. Electro-optic devices can convert optical electro¬ 
magnetic radiation to a material polarization or vice versa. As with the 
devices studied in the last chapter, these devices are constructed around 
a dielectric layer, and the choice of material in the dielectric layer deter¬ 
mines the behavior of the device. Studying these devices is worthwhile 
even though they are encountered significantly less often than capacitors 
and piezoelectric devices because this study illustrates the variety of energy 
conversion devices that engineers have produced. 

If a solid is heated enough, it melts. Some materials have multiple 
crystal structures that are stable at room temperature. These materials 
may be converted from one crystal structure to another by heating and 
cooling. Similar effects can occur if energy is supplied by shining a strong 
enough laser on the material instead of heating it. This chapter is not 
concerned with effects involving melting or thermally changing the crystal 
structure from one phase to another. Instead, we consider the case when a 
small amount of energy is supplied, by heat or by electromagnetic radiation. 
The energies involved are enough to change the material polarization and 
the internal momentum of electrons but not the location of the nuclei of 
the material, for example. 

3.2 Pyroelectricity 

3.2.1 Pyroelectricity in Crystalline Materials 

Pyroelectric devices are energy conversion devices which convert a temper¬ 
ature difference to or from electricity through changes in material polariza¬ 
tion. The pyroelectric effect was first studied by Hayashi in 1912 and by 
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Material 

Chemical 

composi¬ 

tion 

Piezoelectric 
strain const, 
d in ™ from 

|38| |39| 

Pyroelectric 
coeff. 6 in 

m?K 

|38| |39| 

Pockels 
electro-optic 
coeff. 7 in Y 
from [27] 

Sphalerite 

ZnS 

1.60 ■ 10-1^ 

4.34 ■ 10-^ 

o 

1 

to 

Quartz 

Si02 

2.3 ■ 10“^^ 

1.67- 10-® 

0.23 ■ 10“^^ 

Barium 

Titanate 

BaTiOg 

o 

1 

o 

12 ■ 10-® 

19 ■ 10-^2 


Table 3.1: Example piezoelectric strain constants, pyroelectric coefficients, 
and Pockels electro-optic coefficients. Values for sphalerite assume the 43m 
crystal structure. Pockels coefficients assume a wavelength of A = 633 
nm. Average values specified in the references are given. See the cited 
references for additional assumptions. The Pockels electro-optic coefficient 
7 is defined in Sec. 3.3.1. 

Rontgen in 1914 [3] [40]. This effect occurs in insulators, so it is different 
from the thermoelectric effect. The thermoelectric effect, to be discussed in 
Chapter 8, is a process that converts between energy of a temperature dif¬ 
ference and electricity and occurs because heat and charges flow at different 
rates through junctions. 

If an insulating crystal is placed in an external electric field, the material 
will polarize. The electrons will displace slightly forming electric dipoles, 
and energy can be stored in this material polarization. In some pyroelectric 
materials, heating or cooling will also cause the material to polarize. We 
can model the material polarization by adding a term to Eq. 2.14 to account 
for the temperature dependence [3, p. 327]. 

^ = ^-eo^ + ’?AT. (3.1) 

As in Eq. 2.14 , 7^ represents material polarization in represents 

displacement flux density in represents electric field intensity in 

F 

and eo is the permittivity of free space in jjj-. The pyroelectric coefficient 
b has units and AT represents the change in temperature. The 

coefficient 6 is a vector because the material polarization may be different 
along different crystal directions. Table 3.1 lists example values for the 
pyroelectric coefficient as well as for other coefficients. (Note that this 
definition of b is similar but not identical to the definition in [3].) In some 
materials, the material polarization depends linearly on the temperature 
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as described by Eq. 3.1. In other materials, more terms are needed to 
describe the dependence of the material polarization on temperature. 

7^ = - eo^ + tAT + ^ (AT)' ... (3.2) 

Many materials exhibit pyroelectricity only below a temperature known as 
the pyroelectric Curie temperature. 

In the last chapter, we saw that we could determine whether or not a 
crystalline material was piezoelectric from its crystal structure. To do so, 
we identified the symmetries of the crystal structure. Crystal structures are 
grouped into 32 classes called crystal point groups based on the symmetries 
they contain. Crystal structures in the 21 of the crystal point groups that 
do not have a center of symmetry can be piezoelectric. We can use a simi¬ 
lar technique to determine if a crystalline material is or is not pyroelectric. 
All pyroelectric crystals are piezoelectric, but not all piezoelectric crystals 
are pyroelectric. To determine if a crystalline material can be pyroelec¬ 
tric, identify its crystal structure and determine the corresponding crystal 
point group. Crystals in the 10 crystal point groups listed in Table 2.2 are 
pyroelectric [3, p. 366] [26, p. 557]. 

3.2.2 Pyroelectricity in Amorphous and Polycrystalline Materi¬ 
als and Ferroelectricity 

In the Sec. 2.3.3 we saw that some materials, called ferroelectric piezo¬ 
electric materials, had a material polarization that depended nonlinearly 
on the mechanical stress applied. These materials could be crystalline, 
amorphous, or polycrystalline. When a charge separation occurred in one 
atom, the charges from that electric dipole induce dipoles to form in nearby 
atoms, and electrical domains with aligned material polarization form in 
the material. This effect depends on the mechanical stress applied to the 
material previously, and the dependence on past history is called hysteresis. 

Materials can also be ferroelectric pyroelectric, and these materials can 
be crystalline, amorphous, or polycrystalline. In these materials, the ma¬ 
terial polarization depends nonlinearly on the temperature, as opposed to 
the mechanical stress. As with the piezoelectric version of this effect, po¬ 
larization of one atom induces a material polarization in nearby atoms. 
Such materials can have a material polarization even when no temperature 
gradient is applied, and they can exhibit hysteresis. 

3.2.3 Materials and Applications of Pyroelectric Devices 

Pyroelectricity has been studied in a number of materials including barium 
titanate BaTiOs, lead titanate PbTiOa, and potassium hydrogen phosphate 
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KH 2 PO 4 [25] [26]. It has also been studied in chalcogenide glasses which are 
sulfides, selenides, and tellurides such as GeTe [25] [26]. When selecting a 
pyroelectric material for an application, the pyroelectric coefficient should 
be considered. Thermal properties are important too. The material should 
be able to withstand repeated heating and cooling, and it should have a 
relatively high melting temperature to be useful. 

The pyroelectric effect does not have many applications. Some optical 
detectors designed to detect infrared light are made from pyroelectric ma¬ 
terials [41] [42]. However, most optical detectors are photovoltaic devices 
made from semiconductor junctions, and this technology will be discussed 
in Chapter 6 . While sensors using the pyroelectric effect could be used to 
measure temperature, other types of temperature sensors, such as thermo¬ 
couples, are typically used. Thermocouples, which operate based on the 
thermoelectric effect which is discussed in Chapter 8 , are more convenient 
to build and operate. Additionally, in many pyroelectric materials, the ef¬ 
fect is nonlinear while linear sensors are easier to work with and calibrate. 

3.3 Electro-Optics 

3.3.1 Electro-Optic Coefficients 

Typically, the magnitude of material polarization in a dielectric is propor¬ 
tional to the strength of an applied electric field. 

^ ^ - eo^ = eoXe^ (3.3) 

In this equation Xe is the electric susceptibility, and it is unitless. It is 
defined in Sec. 2.2.3 and related to permittivity by Eq. 2.9. However in 
other materials, the material polarization depends nonlinearly on the ap¬ 
plied electric field. Materials for which the material polarization depends 
linearly on the external electric field are called linear materials while others 
are called nonlinear or electro-optic materials. The electro-optic effect oc¬ 
curs when an applied external electric field induces a material polarization 
in a material where the amount of polarization depends nonlinearly on the 
external field. The name involves the word optic because the external field 
is often due to a visible laser beam. However, the external field can be 
from any type of source at any frequency, and a material polarization will 
occur even with a constant applied electric field. A large enough external 
electric field will cause a material to melt or to crystallize in a different 
phase, but this effect is not the electro-optic effect. Instead, the electro¬ 
optic effect only involves a change in the material polarization, not the 
crystal structure, and the change involved is not permanent. 
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We can write the magnitude of the material polarization as a function 
of powers of the applied external field. 

'P =€oXe P P P + ■■■ (3.4) 

The quantity is called the chi-two coefficient, and it has units y. The 
quantity x^^'^ is called the chi-three coefficient, and it has units ^ [27] [42, 
ch. 1]. 

If an infinite number of terms are included on the right side of Eq. 3.4, 
any arbitrary material can be described. In most materials, only the first 
term of Eq. 3.4 is needed while x^‘^\ all higher order coefficients 

are negligible, and these materials are not electro-optic. Materials with 
^(2)^ ^(3) Qj. other coefficients nonzero are called electro-optic. It is rare to 
need more coefficients than Xe, X^‘^\ ^iid x^^^ to describe a material. 

The effect due to the P term is called the Pockets effect or 

linear electro-optic effect. It was first observed by Eriedrich Pockels in 
1893 [3, p. 382] [10]. In this case the material polarization depends on 

the square of the external field. The effect due to the P term is 

called the Kerr effect or the quadratic electro-optic effect. In this case, 
the material polarization depends on the cube of the external electric field. 
John Kerr first described this effect in 1875 [3, p. 382] [10]. 

While some authors use the coefficients Xe, and x^^\ this effect is 
most often studied by optics scientists who instead prefer index of refraction 
n, a unitless measure introduced in Sec. 2.2.3. In electro-optic materials, 
the index of refraction is a nonlinear function of the strength of the external 
electric field. Instead of expanding the material polarization in a power 
series as a function of the external field strength as in Eq. 3.4, the index of 
refraction is expanded. Pockels and Kerr coefficients are defined as terms 
of this expansion. 

(P 

As described by Eq. 2.3, material polarization is the difference in ^ 
between an external electric field in a material and the field in the absence 
of the material. 

Tl = T|-e„|f| (3.6) 

With some algebra, we can identify the displacement flux density compo¬ 
nent and the overall index of refraction. Add two terms which sum to zero 
to Eq. 3.4. 

1^1 = eoXe 1^1 + eol^l + +€oX^^^\P\ -eol^l (3.6) 
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The first two terms can 


be combined, and eo 



can be distributed out. 



iXe + 1) + 


+ X^^^ 

1 

2 

+ ... 

eo 

't 

— eo 

t 


(3.7) 


The first term is the displacement fiux density. 




— 6r 




(Xe + 1 ) + X^^^ 


+ X^^^ 


2 

+ ... 

eo 



(3.8) 


The quantity in brackets in Eq. 3.8 is the relative permittivity, eo- Since 
we are considering electro-optic materials, it depends nonlinearly on the 
applied external field. Assuming the material is a perfect dielectric with 
fj, = fiQ, the index of refraction is the square root of this quantity. It 
represents the ratio of the speed of fight in free space to the speed of light 
in this material, and it also depends nonlinearly on the applied external 
field. 

Ileo eo (^•^) 

The index of refraction must be larger than one because electromagnetic 
waves in materials cannot go faster than the speed of fight, so the quantity 
—must be less than one. 

€r eo 

Some authors expand the term in a Taylor expansion instead of the 
material polarization, and electro-optic coefficients are defined with respect 
to this expansion [42]. 



6r eo x 


(3.10) 


The coefficient 7 is called the Pockels coefficient and it has units The 
coefficient s is called the Kerr coefficient, and it has units In the 
absence of nonlinear electro-optic contributions, we can denote the relative 
permittivity as x and the index of refraction as n^, where 


Cr x = T^l = Xe + ^- (3.11) 

The expansion of Eq. 3.10 is guaranteed to converge because 
Example values of the Pockels electro-optic coefficient are listed in Table 

3.1. 

With some algebra, the overall index of refraction Ueo can be written in 
terms of the Pockels and Kerr coefficients. Equations 3.9 and 3.10 can be 
combined. 
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n^n = 


+ 7 
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— I 1 + 7er- 
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l + 7n. 




+ S6r 


+ sn. 






+ .. 
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- 1/2 
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(3.12) 

(3.13) 

(3.14) 


The quantity of Eq. 3.14 in brackets can be approximated using the bino¬ 
mial expansion and keeping only the first terms. 


l + 7n^ 


^ + sn^ ^ 


2 \ - 1/2 

+ ... 


1 - 27n^ 




-sn^ 
2 ^ 




(3.16) 

Einally, the overall index of refraction can be written as a polynomial ex¬ 
pansion of the strength of the external electric field [10, p. 698]. 


iix ( 1 - ^O'lix 




2 ' 511:1 




(3.16) 


The Pockels electro-optic effect is called the linear electro-optic effect while 
the Kerr effect is called the quadratic effect due to the form of the equation 
above. 


3.3.2 Electro-Optic Effect in Crystalline Materials 

As with the piezoelectric effect, we can determine which crystalline insulat¬ 
ing materials will exhibit the Pockels effect by looking at the symmetries 
of the material. To determine if a crystal can show the Pockels effect, de¬ 
termine the crystal structure, identify the symmetries, and determine its 
crystal point group. The Pockels effect occurs in noncentrosymmetric ma¬ 
terials, materials with a crystal structure with no inversion symmetry. Of 
the 32 crystal point groups, 21 of these groups may exhibit the Pockels 
electro-optic effect. Eor materials in these crystal point groups, and 
the Pockels coefficient 7 are nonzero. These 21 groups are also the piezo¬ 
electric crystal point groups [10, ch. 18], and they are fisted in Table 2.2. 
In some crystalline materials which belong to these crystal point groups, 
the Pockels effect is nonzero but too small to be measurable. 

Erom Table 2.2 we can see that all materials that are piezoelectric are 
also Pockels electro-optic and vice versa. Also, all materials that are pyro¬ 
electric are piezoelectric but not the other way around. Thus, if a device is 
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used as an electro-optic device, and the device is accidentally mechanically 
stressed or vibrated, the material polarization will be induced by piezo¬ 
electricity. In many devices, these effects simultaneously occur, and it can 
be difficult to identify the primary cause of a material polarization when 
multiple effects simultaneously occur. 

Tables of Pockels electro-optic coefficients for crystals can be found in 
reference [27] and [42]. 

The Kerr electro-optic effect can occur in crystals whether or not they 
belong to a crystal point group which has a center of symmetry, so some 
materials exhibit the Kerr effect but not the Pockels effect. In many ma¬ 
terials, the Kerr effect is quite small. 

3.3.3 Electro-Optic Effect in Amorphous and Polycrystalline Ma¬ 
terials 

Table 2.2 only applies to crystalline materials because only crystalline ma¬ 
terials have a specific crystal structure and can be classified into to a crystal 
point group. However, crystalline, polycrystalline, and amorphous mate¬ 
rials can all be electro-optic. In amorphous and polycrystalline materials, 
the electro-optic effect is necessarily nonlinear. When an external electric 
field, for example from a laser, is applied, a material polarization devel¬ 
ops. The charge separation in that region induces a material polarization 
in nearby atoms. Just as materials can be ferroelectric piezoelectric and 
ferroelectric pyroelectric, amorphous and polycrystalline materials can be 
ferroelectric electro-optic. 

3.3.4 Applications of Electro-Optics 

Some controllable optical devices are made from electro-optic materials. 
Examples of such devices include controllable lenses, prisms, phase mod¬ 
ulators, switches, and couplers [10]. Operation of these devices typically 
involves two laser beams. One of these beams controls the material polar¬ 
ization of the device. The intensity, phase, or electromagnetic polarization 
of the second optical beam is altered as it travels through the device [10, 
p. 698-700]. Combinations of these electro-optic devices are used to make 
controllable optical logic gates and interconnects for optical computing ap¬ 
plications [10, ch. 21] [31, ch. 20]. 

Most memory devices are not made from electro-optic materials, but 
some creative memory device designs involve electro-optic materials. Eor 
example, electro-optic materials are used for some rewritable memory [10, 
p. 712] [27, p. 534] and for hologram storage [10, ch. 21] [27, ch. 20]. 



3 PYROELECTRICS AND ELECTRO-OPTICS 


61 


Also, electro-optic materials are used in liquid crystal displays [10, ch. 18]. 
Liquid crystals are electro-optic materials because an external voltage alters 
their material polarization [10, ch. 18]. 

Electro-optic materials are also used to convert an optical beam at one 
frequency to an optical beam at a different frequency. Second harmonic 
generation involves converting an optical beam with photons of energy E 
to a beam with photons at energy [10, ch. 19] [27, ch. 18] [31, ch. 16]. 
Electro-optic materials are used in the second harmonic generation process 
as well as in the related processes of third harmonic generation, three wave 
mixing, four wave mixing, optical parametric oscillation, and stimulated 
Raman scattering [10, ch. 19]. 

3.4 Notation Quagmire 

This text attempts to use notation consistent with the literature. However, 
consistency is a challenge because every author seems to have a different 
name for the same physical phenomena. Eurthermore, the same term used 
by different authors may have completely different meanings. Eor example, 
as described by Eq. 2.19, in some materials, a mechanical stress induces a 
material polarization proportional to the square of that stress. This text 
calls this phenomenon piezoelectricity, or to be more specific, quadratic 
piezoelectricity. However, references [3] and [6] call this phenomenon elec- 
trostriction. To make matters worse, reference [33] calls this effect ferro- 
electricity. Some authors make different assumptions when using terms too. 
Eor example, when reference [26] uses the term ferroelectricity, it assumes 
crystalline materials, but it makes no assumptions about whether the effect 
is linear or not. 

Table 3.2 summarizes the notation used in this text to describe energy 
conversion processes involving material polarization. The first column lists 
the name used here to describe the effect. The second column lists what 
effect causes a material polarization. The third column describes whether 
the effect occurs in crystals only. The fourth column describes whether the 
material polarization varies linearly or not with the parameter described 
in the second column. The next column lists references which call this 
effect ferroelectricity. The last column gives names used by other references 
to describe this particular phenomenon. The last two columns are quite 
incomplete because a thorough literature survey was not done. However, 
these columns show quite a variety to the terminology even for the small 
fraction of the literature reviewed. 

You might think that you can avoid confusion of terminology by looking 
for Greek or Latin roots. While many of the terms introduced in the preced- 
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Table 3.2: Terminology related to processes involving material polarization. 
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ing chapters do have etymological roots, looking at the roots of the words 
does not help and sometimes makes matters worse. As discussed above, 
the prefix ferro- means iron. However, the ferroelectric effect has nothing 
to do with iron, and ferroelectric materials rarely contain iron. This name 
is an analogy to ferromagnetics. Some forms of iron are ferromagnetic. In 
ferromagnetics, an external magnetic field changes the permeability of a 
material. In ferroelectrics, an external electric field influences the permit¬ 
tivity. To make matters worse, iron has the periodic table symbol Ee while 
iridium has the symbol Ir. In this text, the term pyroelectric effect follows 
Roentengen’s terminology which dates 1914 [3]. The root pyro-, showing 
up in pyroelectricity, also shows up in pyrite and pyrrhotite which are iron 
containing compounds. 

Sometimes the terms phase change and photodarkening are applied to 
the electro-optic effect in amorphous materials, but not crystalline mate¬ 
rials. More specifically, sulfides, selenides, and tellurides, referred to as 
chalcogenides, are sometimes called phase change materials. Examples in¬ 
clude GeAsS, GelnSe, and so on. The word chalcogenide is itself a mis¬ 
nomer. The prefix chalc- comes from the Greek root meaning copper [24]. 
They are named in analogy to GuS, chalcosulfide. The name phase change 
material was popularized by a company that made GDs and battery com¬ 
ponents. While crystalline materials can also be electro-optic, the name 
phase change is not typically applied to crystals. 

Sometimes the terminology used in the literature can be quite different 
from the terminology of this text. Eor example, reference [44] describes 
material polarization in chalcogenide glasses by saying that when exposed 
to external optical electric fields, a material stores energy by “a transient 
exciton which can be visualized as a transient intimate valence alternation 
defect pair.” ... “This means essentially that macroscopic anisotropies 
result from geminate recombination of electron-hole pairs, which do not 
diffuse out of the microscopic entity in which they were created by absorbed 
photons.” An exciton is a bound electron-hole pair. In other words, 
the material polarizes. When an external optical electric field is applied, 
electric dipoles form throughout the material. When reading the literature 
related to piezoelectricity, pyroelectricity, and electro-optics, be aware that 
there is not much consistency in the terminology used. 
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3.5 Problems 


3.1. For each of the three crystalline materials below 

• Find the crystal point group to which it belongs. 

(Hint: use http://www.mindat.org ) 

• Using Table 2.2, determine whether or not the material is piezo¬ 
electric. 

• Using Table 2.2, determine whether or not the material is pyro¬ 
electric. 

• Using Table 2.2, determine whether or not the material is Pock- 
els electro-optic. 

(a) ZnS (sphalerite) 

(b) HgS (cinnabar) 

(c) Diamond 

3.2. Cane sugar, also called saccharose, has chemical composition 

C 12 H 22 O 11 and belongs to the crystal point group given by 2 in 
Hermann-Maguin notation [38]. Reference [38] lists values specified 
in cgse units for its piezoelectric constant as 10.2 ■ 10“® its 

pyroelectric coefficient as 0.53 Convert these values to the 

SI units of Y and respectively. 

Hint: The electrostatic unit or statcoulomb is a measure of charge [7] 
where 

1 esu = 1 state = 3.335641 ■ 10"^° C 
and the dyne is a measure of force where 1 dyne = 10“^ N. 

3.3. A material has relative permittivity e,. x when no external electric 
field is applied. The coefficient is measured in the presence of 
an external electric field of strength |)^|. Assume that and all 
higher order coefficients are zero. Find the Pockels coefficient 7 as a 
function of the known quantities x, and |^|. 
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3.4. The first figure below shows the displacement flux density 




a function of the strength of an applied electric field intensity 


in a non-electro-optic materia! 


as 




displacement flux density 




applied electric field intensity 


The second figure below shows the 
as a function of the strength of an 




terial. The solid line corresponds 


in a ferroelectric electro-optic ma- 
0 an unpoled material. The dotted 


line corresponds to the material after it has been poled in the di¬ 
rection, and the dashed line corresponds to the material after it has 
been poled in the —az direction. 


(a) Eor the non-electro-optic material, find the relative permittivity, 
e^. Also find the magnitude of the material polarization, i^. 


(b) Assume the ferroelectric electro-optic material is poled by a 
strong external electric field, and then the field is removed. Eind 


the magnitude of the material polarization 
nal field is removed. 




after the exter- 


(c) Assume the ferroelectric material is poled in the —az direction by 
a strong external field, and then the field is removed. A different 
external electric field given by ^ = 100a2 is applied. Eind 
the approximate relative permittivity of the material. 



in V/ni 



66 


3.5 Problems 



3.5. A crystalline material is both piezoelectric and pyroelectric. When 


an external electric field of |i^| = 100 ^ is applied, the material 
polarization is determined to be |7^| = 1500eo When both a 


stress of 


1^ = 30 ^ and an external electric field of |^| 


= 100 ^ 


are applied, the material polarization is determined to be |^| = 
6.0123 ■ 10“® When a temperature gradient of AT = 50 °C, a 


stress of 


1 ^ 


m^ 
= 30 


m 


, and an external electric field of |^| 


= 100 ^ 


are applied, the material polarization is determined to be |^| 


6.3 ■ 10 


-6 C 


m^ 


Find: 


• The relative permittivity of the material 

• The piezoelectric strain constant 

• The magnitude of the pyroelectric coefficient 
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4 Antennas 

4.1 Introduction 

In the previous two chapters we discussed energy conversion devices which 
are made from insulators and which are related to capacitors. In Chapters 
4 and 5 we discuss energy conversion devices involving conductors and 
related to inductors. Maxwell’s equations say that time varying electric 
fields induce magnetic fields and time varying magnetic fields induce electric 
fields. If a permanent magnet moves near a coil of wire, the time varying 
magnetic field will induce a current in the coil of wire. This idea is the 
basis behind motors and electrical generators, which are some of the most 
common energy conversion devices. However, they are outside the scope 
of this text because they involve magnets and coils. Instead, we will study 
two other types of energy conversion devices based on this same principle. 
In this chapter we discuss antennas, and in the next chapter we will discuss 
Hall effect devices. 

Antennas are energy conversion devices that convert between electrical 
energy and electromagnetic energy. Antennas can act as both transmitters 
and receivers. Transmitters convert electrical energy of the flow of elec¬ 
trons to energy of electromagnetic waves. Receivers convert energy from 
electromagnetic waves to the electrical energy of electrons in a circuit. The 
same physical antenna can operate in both ways depending on how it is 
used. 

Antennas are all around us. Cell phones and laptops have antennas, 
and antennas are mounted on the roofs of most cars. Antennas relay infor¬ 
mation about the electrical grid to the local power utility, and antennas on 
satellites transmit weather maps to weather stations on earth. Antennas 
are even built into RFID tags on shirts in stores, and these tags are used 
to track inventory and prevent theft. 

Electrical engineers study both electrical energy and electromagnetic 
energy, and the words used to describe these phenomena are similar. Is this 
really an energy conversion process? The answer is yes. Electrical energy 
involves the flow of electrons through a wire. We often think of electrons as 
particles. We often use the term electromagnetic wave to describe the flow 
of electromagnetic energy transmitted by an antenna. However, electrons 
have both wave-like and particle-like properties. Similarly, electromagnetic 
waves have both wave-like and particle-like properties. The wavelengths 
involved are orders of magnitude apart, so it is convenient to only discuss 
either the wave-like or the particle-like properties. There are fundamental 
differences between electricity and electromagnetic waves. Fermions are 
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Center-fed half-wave dipole: 


Quarter-wave monopole: 


Transmission 

Line 



Transmission 

Line 




Conducting 

plane 


Figure 4.1: Center-fed half-wave dipole and quarter-wave monopole anten¬ 
nas. 


elementary particles with half integer spin quantum numbers and with 
quantum mechanical wave functions which are antisymmetric when two 
particles are interchanged [46, p. 391]. Bosons are elementary particles 
with integer spin quantum numbers and with wave functions which are 
symmetric when two particles are interchanged [46, p. 391]. Electrons 
are fermions while electromagnetic waves are bosons. So, antennas are 
energy conversion devices. A complete discussion of the differences between 
fermions and bosons requires the study of quantum mechanics and quantum 
field theory which are beyond the scope of this book. 

An antenna may be as simple as a single metal rod, it may be a copper 
trace on a printed circuit board, it may be a cone shaped horn, or it may be 
a complicated arrangement of multiple wires. Some antennas even resemble 
planar or volume fractals [47] [48]. Hundreds of types of antennas have 
been developed. Seventy five types are discussed in [49], and 91 types are 
discussed in [50]. 

The simplest antenna is just a piece of wire. It may be straight and 
taut, or it may be carelessly strung from a tree. For an antenna designed to 
operate at wavelength A, the length of the antenna is often approximately 

A straight antenna of length | with signal supplied to the center is 
called a center-fed half-wave dipole or a | dipole. Some antennas are placed 
above a conducting plate, or above a conductive surface, which acts as a 
reflector. A straight antenna of around length | supplied by a signal at 
one end with a reflector beneath is called a quarter-wave monopole or a 
I monopole. Figure 4.1 illustrates both dipole and monopole antennas. 
While a random wire will act as an antenna, an antenna with frequency 
response, impedance, radiation pattern, and electromagnetic polarization 
designed for the specific application will perform much more efficiently, and 
these factors are discussed below in Sec. 4.4. 
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4.2 Electromagnetic Radiation 

4.2.1 Superposition 


The physics of antenna operation is described by Maxwell’s equations. Am¬ 
pere’s law, one of Maxwell’s equations, was introduced in Section 1.6.1. 




di 


(4.1) 


In Eq. 4.1, is the magnetic field intensity in is the displacement 


fiux density in and J is the current density in In the case of 
a transmitting antenna, the current density in the antenna comes from a 
known source, and the electromagnetic field, described by and , can 
be derived. 

Using Maxwell’s equations, we can algebraically derive the electromag¬ 
netic field only for very simple antennas. The simplest antenna is an in¬ 
finitesimal dipole antenna, also known as a Hertzian dipole. References 
[11] derives the electric field intensity, in units for an infinitesimal 
dipole antenna with length dl and sinusoidal current Iq cos{ut). The result 
is given in spherical coordinates is 


, and is the current density in 


A 




27o•(i^cos 9 

47reLO 

I Ip-dl-sm 6 
' 4:7reu) 


sin(a;t-^r) ^cos(ijt-^r) 
::3 I" Z2 


^cos(ujt-^r) sm(ujt-^r) 

.V.3 “T ^2 


(4.2) 


0-0 ■ 


In this expression, lo is frequency in 1^, A is the wavelength in meters, 

e is the permittivity of the material surrounding the antenna in jjj, and 
(r, 6*, 0) are the coordinates of a point specified in spherical coordinates. 

Eor complicated antennas, superposition is used to make the computa¬ 
tion feasible. To derive the electromagnetic radiation from a complicated 
antenna, small straight antenna segments are considered [15, ch. 10]. The 
electromagnetic radiation from each piece is found, and the prineiple of 
superposition is the idea that the radiation from the entire antenna is the 
sum of these pieces. The same idea applies to linear circuits. If a circuit has 
a complicated input, the input can be broken up into simpler components. 
Any voltage in the circuit can be found by finding the contribution due to 
each of these components then summing. 


4.2.2 Reciprocity 

Reeiproeity is the idea that the behavior of an antenna as a function of 
angle is the same regardless of whether the antenna is being used to send 
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Receiver 

Area 


Figure 4.2: Illustration of power radiating from an isotropic antenna. 

or receive a signal [15, ch. 10]. A plot of the strength of the field radiated 
from a transmitter as a function of the angles 9 and 0 is called a radiation 
pattern plot. Similarly, a plot of the strength of the signal received by 
a receiving antenna as a function of angles 9 and 0 assuming a uniform 
field strength is also called a radiation pattern plot. Consider two identical 
antennas, one being used as a transmitter and the other as a receiver. The 
radiation pattern plots will be the same for these two antennas. 

Regardless of the idea of reciprocity, it is often a bad idea to swap the 
transmitting and the receiving antennas of a system because a transmitter 
may be designed to handle much more power than a receiver [15, p. 479]. A 
receiving antenna of effective area A at a distance r from an antenna which 
transmits uniformly in all directions receives at most only the fraction 
of the transmitted power [49, p. 4]. 

P - P 

47rr^ 

For example, consider an antenna that transmits 20 kW of power uni¬ 
formly in all directions. Assume a receiving antenna has an effective area 
of 10 cm^ and covers a portion of a spherical shell as shown in Fig. 4.2. 
What is the power received assuming that the antenna is at a distance of 
r = 1 m, and what is the power received assuming a distance of r = 1 km? 

The surface area intercepted by the receiver is 10 cm^ = 10“^ m^. In 
the first case, this surface area is the fraction of the surface sphere of 
radius 1 m. At most, the antenna can receive this fraction of the power. 

10-3 

P = 20 ■ 10=^-r = 

47r- 12 


(4.3) 


1.6 W 


(4.4) 
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In the second case, this surface area is the fraction of the surface 

of the sphere of radius 1 km. At most, this antenna can receive 

in-3 

P = 20 ■ 10^-2 = 1.6 /iW. (4.5) 

47r ■ (10^) 

From this example, we can already see some of the advantages and chal¬ 
lenges in using electromagnetic waves for communication, and we can see 
some of the consequences of antenna design. The transmitted power in this 
example is orders of magnitude larger than the received power. In such a 
situation, the transmitting circuitry and receiving circuitry will look very 
different due to the amount of power and current expected during opera¬ 
tion. The antennas used will likely also look very different. An antenna 
transmitting kilowatts of power may need to be mounted on a tower while 
a receiving antenna that receives milliwatts of power may be built into a 
portable hand held device. 

A typical radio station may want to transmit throughout a city, a radius 
much larger than 1 km. Furthermore, no energy conversion device is 100% 
efficient. The electrical power at the receiver 1 m away is therefore going to 
be less than 1.6 W, and the power at the receiver 1 km away is going to be 
less than 1.6 fxW. Also, all radio receivers are limited by noise. Suppose, for 
example, that this transmitter is placed at the center of a city of radius 1 
km and the receiver can only successfully receive signals with power above 
1 fj,W due to 1 fj,W of background noise. A receiver placed 1 km away at 
the edge of the city may be able to receive the signal successfully, while 
a receiver further away in the suburbs may not. However, many receivers 
placed 1 km away with this surface area of 10 cm^ could simultaneously 
detect the radio signal. 

If no building in the city is taller than 10 stories, no receivers are likely 
to be found at a height over 30 m, for example, above the surface of the 
earth. However, the transmitter in this example radiates power uniformly 
in all directions including up. We can design antennas which radiate power 
in some directions more than others. If we could focus all power from this 
antenna at altitudes below 30 m, the power at a particular receiver may 
be larger than we calculated above, so a receiver farther away may be able 
to detect the signal. The radiation pattern of an antenna is the spatial 
distribution of the power from the antenna. Radiation pattern plots are 
discussed further in Sec. 4.4.3. 

This example also provides some insights on the safety of working with 
antennas. The 10 cm^ surface area in this example is, to an order of 
magnitude, the surface area of a human hand. A typical microwave oven 
uses less power than the transmitter in this example. Kilowatts of power 
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are enough to cook with, so for this reason, it would be dangerous to touch 
or even, depending on the frequency, be close to the transmitting antenna. 
The antenna in this example needs to be mounted on an antenna tower 
not only for mechanical reasons but also for safety reasons. The amount 
of power through this surface depends on distance from the transmitter as 
so the danger level is strongly dependent on distance from the antenna. 


4,2,3 Near Field and Far Field 

The region within about a wavelength of an antenna is called the near field 
region. The region beyond multiple wavelengths from an antenna is called 
the far field or Fraunhofer region. For aperture antennas, instead of wire 

, 2 (aperture size)^ . , i r 

antennas, distances larger than - — are considered m the tar 

field [15, p. 498]. The radiation pattern in the near field region and in 
the far field region are quite different. Near field electromagnetic radiation 
is used for some specialized applications including tomographic imaging of 
very small objects [51]. However, receiving antennas used for communica¬ 
tion signals almost always operate in the far field region from transmitting 
antennas. As an example of the difference between near field and far field 
behavior of an antenna, consider the infinitesimal dipole antenna. The 
electric field intensity is given in Eq. 4.2. The near field electric field from 
this infinitesimal antenna is found by taking the limit as r —)■ 0. 




If) ■ dl ■ cos 6 
ineu 


sin (ut — 


(fidr + dg) 


(4.6) 


The far field electric field is found by taking the limit as r —)■ cx). 


^ _ -Iquj ■ dl ■ sin 9 sin [ut - ^r) 
47re r 


ag 


(4.7) 


4,2,4 Environmental Eflfects on Antennas 

The electromagnetic radiation from an antenna is affected by the envi¬ 
ronment surrounding the antenna, specifically nearby large conductors. 
Sometimes conductors are purposely placed nearby to make an antenna 
directional. Other times, the conductors, like metal roofs or bridges, just 
happen to be nearby. If an antenna is placed near a salty lake, the lake 
surface will reflect the electromagnetic radiation. In other cases, the electri¬ 
cal properties of soil underneath an antenna will affect the electromagnetic 
radiation [50, ch. 8] [15, p. 635]. 



4 ANTENNAS 


73 


Environment 

Conductivity a in 

Relative permittivity e,. 

Industrial city 

0.001 

5 

Sand 

0.002 

10 

Rich soil 

0.01 

14 

Fresh water 

0.001 

80 

Salt water 

5 

80 


Table 4.1: Conductivity and relative permittivity of different environments, 
[50, ch. 8], 

Numerical simulations are used to understand how an antenna behaves 
near metal roofs, nearby lakes, or other objects. The effects of the environ¬ 
ment are modeled by assigning nearby materials an electrical conductivity 
a, permittivity e, and permeability /i. Often the surroundings have fx ~ ixq, 
but the other parameters can vary widely. Table 4.1 lists values of electrical 
conductivity and relative permittivity used to model different environments 
as suggested by reference [50, ch. 8]. The values listed are approximates 
due to the variety of environments within each category. Additionally, the 
conductivity can vary from day to day. For example, electromagnetic waves 
may interact with farmland very differently on a snowy winter day, after a 
spring rainfall, and during a dry spell in summer. Also, even for a single 
uniform material, conductivity and permittivity are functions of frequency. 

4.3 Antenna Components and Definitions 

Antennas used for radio frequency communication are made from conduct¬ 
ing wire elements. These elements may be classified as driven or parasitic 
[50]. All antennas have at least one driven element. In a transmitting an¬ 
tenna, power is supplied to the driven element. Current flowing through 
the antenna induces an electromagnetic field around the antenna. In a re¬ 
ceiving antenna, the driven element is connected to the receiving circuitry. 
Some antennas also have parasitie elements. These elements affect the an¬ 
tenna’s radiation pattern, but they are not connected to the power supply 
or receiving circuitry [50]. The electric field inside a perfect conductor is 
zero, so putting a good conductor near an antenna influences the antenna’s 
radiation pattern. Parasitic elements may be included in the antenna to fo¬ 
cus the electromagnetic field in a particular direction, alter the bandwidth 
of the antenna, or for other reasons. Antennas are often mounted on a 
metal rod for mechanical support, and this rod is called a boom. 
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4.3 Antenna Components and Definitions 


Antennas may be used individually or as part of an array. Arrays may 
also be driven or parasitic. In a driven array, all elements are connected 
to the power supply or receiving circuitry [50]. In a parasitic array, one or 
more of the elements are parasitic and not connected [50]. Arrays are also 
classified based on the direction of radiation compared to the axis of the 
array. In a broadside array, radiation is mostly perpendicular to the axis of 
the array while in an end fire array, radiation is mostly along the direction 
of the axis of the array [50]. 

A transmission line is a pair of conductors which is used to transmit 
a signal and which is very long compared to the wavelength of the signal 
being sent. Communications engineers and power systems engineers both 
use the term transmission line, but they make different assumptions. To 
a communications engineer, it is a long pair of conductors over which a 
signal is sent. To a power systems engineer, it is a cable that is part of the 
power grid. The communications definition will be used in this text. The 
conductors of a transmission line may be a pair of parallel wires, they may 
be a waveguide formed by a pair of parallel plates, they may be a coax 
cable, or they may have another geometry. Coax cable is formed by a wire 
and cylindrical tube separated by an insulator, both with the same axis, 
so they are coaxial. For example, a coax cable connecting a transmitter 
operating at a frequency of / = 88 MHz on the first fioor of a building and 
an antenna on the top of the tenth fioor of the building is a transmission 
line because the length of the cable is long compared to the wavelength of 
A = 3.4 m. As another example, a pair of wires connecting a transmitting 
circuit operating at / = 4 GHz on one end of a printed circuit board and 
an antenna on the other end 25 cm away is also a transmission line because 
the length of the wires is long compared to the wavelength of A = 7.5 cm. 

Some antennas have a balun. Baiun is a contraction for balanced/ un¬ 
balanced. It is used between balanced loads and unbalanced transmission 
lines [15, p. 406] [50]. A typical transmission line, made up of a coax 
cable, is constructed from an inner conductor and an outer conductor. 
These conductors have different radii, so they have different impedances. 
The transmission line is called unbalanced due to this impedance differ¬ 
ence. Suppose that this transmission line is connected to a dipole antenna 
formed from two symmetric conductors. The impedance of the two arms 
of the dipole are equal, so we say that it is a balanced load. A balun can 
used in this type of situation when a balanced antenna is connected to an 
unbalanced transmission line. By properly choosing the impedance of a 
balun, refiections at the interface between the antenna and transmission 
line can be reduced so that more energy gets to or from the antenna and 
less remains stored in the transmission line. 
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Frequency 

Abbreviation 

Name 

30-3000 Hz 

ELF 

Extremely Low Frequency 

3-30 kHz 

VLF 

Very Low Frequency 

30-300 kHz 

LF 

Low Frequency 

300 kHz -3 MHz 

ME 

Medium Frequency 

3-30 MHz 

HE 

High Frequency 

30-300 MHz 

VHF 

Very High Frequency 

300 MHz-3 GHz 

UHF 

Ultra High Frequency 

3-30 GHz 

SHF 

Super High Frequency 

30-300 GHz 

EHF 

Extremely High Frequency 


Table 4.2: Names of electromagnetic frequency ranges [15] [54], 


4.4 Antenna Characteristics 

Four main factors which differentiate antennas are frequency response, 
impedance, directivity, and electromagnetic polarization. When selecting 
an antenna for a particular application, these factors should be considered. 
In this section, these and other factors which influence antenna selection 
are discussed. 

4.4.1 Frequency and Bandwidth 

Electromagnetic waves of a wide range of frequencies are used for commu¬ 
nication. Different names are given to electromagnetic signals at different 
frequency ranges. Table 4.2 lists the name used to refer to various frequency 
bands for which antennas are used. 

Electromagnetic waves are rarely used for communication at the lowest 
frequency band listed in Table 4.2. However, one example was Project 
ELF (short for Extremely Low Frequency). It was a US military radio 
system used to communicate with submarines, and it operated at 76 Hz 
[52]. The array involved 84 miles of antennas spread out near a transmitting 
facilities in northern Wisconsin and the upper peninsula of Michigan [52], 
and it operated from 1988 to 2004 [53]. It had an input power of 2.3 MW, 
but only 2.3 W of electromagnetic radiation was transmitted due to the 
fact that the length of the antenna elements used was a small fraction of 
the wavelength. The few watts transmitted were able to reach submarines 
under the ocean throughout the world [52]. Three letter messages took 
15-20 minutes to transmit or receive [52]. 

Antennas are commonly used to transmit and receive electromagnetic 
radiation in the frequency range from 3 kHz < / < 3 THz. However, an 
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4.4 Antenna Characteristics 


antenna designed to operate at 3 kHz looks quite different from an antenna 
designed to operate at 3 THz. Wire-like antennas are used for signals 
roughly in the frequency range 3 kHz < / < 3 GHz. Solid cone, plate¬ 
like, or aperture antennas are used to transmit and receive signals in the 
frequency range 3 GHz < / < 3 THz [15, ch. 15]. To understand the need 
for different techniques, consider the wavelengths involved. A signal with 
frequency / = 30 kHz, for example, has a wavelength A = 1.00 ■ 10^ m. 
The length of an antenna is often of the same order of magnitude as the 
wavelength. While we can construct wire antennas of this length, they not 
portable. As another example, a wifi signal which operates at 2.5 GHz 
has a wavelength of A = 12.5 cm. Wire antennas which are this length are 
easy to build and transport. However, wire antennas designed for signals at 
higher frequencies can be difficult to construct accurately due to their small 
size. For this reason, wire antennas are typically used at lower frequencies 
while cone or plate-like antennas are used higher frequencies. 

A human eye can detect electromagnetic radiation with frequencies and 
wavelengths in the range 

4.6 ■ 10^^ Hz < / < 7.5 ■ 10^^ Hz or 400 nm < A < 650 nm 

Antennas are not used to receive and transmit optical signals due to the 
small wavelengths involved even though optical signals obey the same fun¬ 
damental physics as radio frequency electromagnetic radiation. Green light, 
for example, has a wavelength near A = 500 nm and a frequency near 6-10^^ 
Hz. An antenna designed to transmit and detect this light would need to be 
approximately of length | ^ 250 nm. An atom is around 0.1 nm in length, 
so an antenna designed for green light would be only approximately 2500 
atoms long. Antennas of this size would be impractical for many reasons. 
Another reason that different techniques are needed to transmit and re¬ 
ceive optical signals is that electrical circuits cannot operate at the speed 
of optical frequencies. Techniques for transmitting and detecting optical 
signals are discussed in Ghapters 6 and 7. 

When selecting an antenna, the range of frequencies that will be trans¬ 
mitted or received as well as their bandwidth should be considered. Some 
antennas are designed to operate over a narrow range of frequencies while 
other antennas are designed to operate over a broader band of frequencies. 
An antenna with a narrow bandwidth would be useful in the case when an 
antenna is used to receive signals only in a specific frequency band while 
an antenna with a broad bandwidth would be useful when an antenna is 
to receive signals over a wider frequency range. For example, an antenna 
designed to receive over the air television signals in the US should be de¬ 
signed for the broad range from 30 MHz - 3 G Hz because television signals 
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fall in the VHP and UHF ranges. 

Like all sensors, antennas detect both signal and noise. Noise in a radio 
receiver may be internal to the receiving circuitry or due to external sources 
such as other nearby transmitters [49, p. 4]. An antenna with a broad 
bandwidth will receive more noise due external sources than an antenna 
with a narrow bandwidth. Noise characteristics of an antenna influence the 
ability to receive weak signals, so they should be considered in selecting an 
antenna for an application [50]. 

4.4.2 Impedance 

Both antennas and transmission lines have a characteristic impedance. The 
term transmission line is deflned in Sec. 4.3 as a long pair of conductors. If 
the length of the conductors is long compared to the wavelength of signal 
transmitted, the voltage and current may vary along the length of the line, 
and energy may be stored in the line. For this reason, transmission lines 
are described by a characteristic impedance in ohms. The characteristic 
impedance gives the ratio of voltage to current along the line, and it pro¬ 
vides information on the ability of the transmission line to store energy 
in the electric and magnetic held. Typical values for the impedance of 
transmission lines used for communications are 50 or 75 Similarly, each 
antenna has its own characteristic impedance, measured in ohms, which 
represents the ratio of voltage to current in the antenna. 

Why is the impedance important? Transmitting antennas are often 
physically removed from the signal source and connected by a transmission 
line. Similarly, receiving antennas are often in a different location than 
receiving circuitry and connected by a transmission line. To efficiently 
transmit a signal between transmitting or receiving circuitry and an an¬ 
tenna, the impedance between the antenna and transmission line should 
be matched. In this case, where the characteristic impedance of the line 
and antenna are equal, energy flows along the transmission line between 
the circuitry and the antenna. Transmission lines are made from good, but 
not perfect, conductors. A small amount of energy may be converted to 
heat due to the resistance in the lines, but this amount of energy is often 
trivial. However, if there is an impedance mismatch between the antenna 
and the transmission line, reflections will be set up at the transmission line 
antenna interface. Less energy will be transmitted to or from the antenna 
because energy will be stored in the line, and the amount of energy involved 
may be significant. In a properly designed system were the impedances of 
the antenna and the transmission line are matched, no reflection occurs, so 
as much energy as possible is transmitted to or from the antenna. 
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4.4 Antenna Characteristics 


Impedance of an antenna is a function of frequency. Antennas transmit 
and receive communications signals which are almost never sinusoids of a 
single frequency. Often, however, the signals contain only components with 
frequencies within a narrow band. For example, a radio station may have a 
carrier frequency of 100 MHz, and it may transmit signals with frequency 
components 99.99 MHz </< 100.01 MHz. In this case, the impedance of 
the antenna may be approximated by the impedance at 100 MHz. 


4.4.3 Directivity 

Antennas can be designed to radiate energy equally in all directions. Al¬ 
ternatively, antennas can be designed to radiate energy primarily along a 
single direction. Directivity D is a unitless measure of the uniformity of 
the radiation pattern plot. It is defined as the ratio of the maximum power 
density over the average power density. 

^ Maximum power density radiated by antenna . 

Average power density radiated by antenna 

An antenna which radiates equally in all directions is called isotropic. 
An antenna that radiates equally in two, but not the third, direction is 
called omnidirectional [15]. For example, an omnidirectional antenna may 
radiate equally in all horizontal directions but not the vertical direction. 
Isotropic antennas have D = 1 while all other antennas have D > 1. Some 
applications require an isotropic antenna. For example, a radio station in 
the center of a town might use an isotropic or omnidirectional antenna 
to transmit to all of the town. In other cases, a directional antenna is 
preferred. A stationary weather station that transmits data to a fixed base 
station would be wasting energy using an isotropic antenna because it could 
use less transmitted power with the same received power using a directional 
antenna. 

Received power may be larger than given by Eq. 4.3 if directional 
antennas are used instead of isotropic antennas. For a transmitting antenna 
with gain Gtrans and a receiving antenna with gain Grec compared to an 
isotropic antenna, Eq. 4.3 becomes 


Free = Ft 


trans^t^r 


where the effective area is assumed to be related to the receiver gain by 


Gr = 


( 4 , 10 ) 
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Equation 4.9 is known as the Eriis equation [55]. Received power will be 
less than given by Eq. 4.3 or 4.9 due to losses in the air or other material 
through which the signal travels and due to a difference in electromagnetic 
polarization between the transmitter and receiver [49, p. 4]. 

Directivity is a rough measure of an antenna. A more accurate mea¬ 
sure is a radiation pattern plot. The radiation pattern plot is a graphical 
representation of intensity of radiation with respect to position throughout 
space. A radiation pattern plot may be a 3D plot or a pair of 2D plots. In 
the case where two 2D plots are used, one of the plots is an azimuth plot 
and the other is an elevation plot. The azimuth plot shows a horizontal 
slice of the 3D radiation pattern, parallel to the xy plane. The elevation 
plot shows a vertical slice, perpendicular to the xy plane. Most radiation 
pattern plots, including all shown in this text, are labeled by the ampli¬ 
tude of the electric field [15] [56]. However, occasionally they are labeled by 
the amplitude of the power instead. The radiation pattern of an antenna 
is quite different in the near field, at a distance less than about a wave¬ 
length, and in the far field, with distances much greater than a wavelength. 
Radiation pattern plots illustrate the far field behavior only. 

Eigure 4.3 shows the radiation pattern plot for a half-wave dipole an¬ 
tenna in free space, and it was plotted using the software EZNEC [56]. The 
acronym NEC stands for Numerical Electromagnetics Code. The figure in 
the upper left is the azimuth plot, the figure in the upper right is the ele¬ 
vation plot, the figure in the lower left is a 3D radiation pattern plot, and 
the figure in the lower right is the antenna layout. 

Eigure 4.4 shows the radiation pattern plots for a 15-meter quad an¬ 
tenna. Distinct lobes and nulls are apparent. 

Front to baek ratio (E/B ratio) is a measure related to directivity that 
can be found from the azimuth radiation pattern plot. By definition, it is 
the ratio of the strength of the power radiated in the front to the back. 
Often, the front direction is chosen to be the direction of largest magni¬ 
tude in the radiation pattern plot, and the back direction is the opposite 
direction. E/B ratio can be specified either on a log scale in units of dB or 
on a linear scale which is unitless. It can also be defined either as a ratio 
of the strength of the electric field intensities or as a ratio of the strengths 
of the powers, but most often power is used. 


E/B ratio= 


P 


front 


P^ 


back 


= 10 log 


10 


dB 


P, 


front 


P 


back 


= 20 log 


10 


lin 




front 




back 


lin 

(4,11) 
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Figure 4.3: Radiation pattern plots for a half-wave dipole antenna. 
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Figure 4.4: Radiation pattern plots for a 15-meter quad antenna. 
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F/B ratio= 


Pfront 

— 9 


front 


p 

^ back 

dB 


^ back 



-> dB 


(4,12) 


The F/B ratio for the example of Fig. 4.4 can be calculated from the 
azimuth plot. The strength of the field in front direction is 9 dB stronger 
than the strength of the field in the back direction. 



9 dB 


(4,13) 


From this information, we can calculate the strength of the field in the 
front direction to the strength of the field on a linear scale. 
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= lOTo = 7.94 


lin 


(4,14) 


(4,16) 


(4,16) 


If this antenna is being used as a transmitter, signal in the front direction 
is 7.9 times as strong as the signal in the back direction. The front to back 
ratio specifies the power ratio, and for this antenna, it is 18 dB. 


F/B ratio= 


Pfront 

— 9 


^ front 


Pback 

dB 


^ back 



18 dB. 


(4,17) 


When selecting an antenna, many decisions related to the antenna di¬ 
rectivity are needed. A particular application may require an isotropic or 
a directional antenna. If a directional antenna is needed, the magnitude 
of the directivity must be decided. Additionally, the orientation of the 
antenna must be decided so that nodes and nulls are in the appropriate 
directions. Both the azimuth angle and the elevation angle of the nodes 
and nulls should be considered [50, p. 22-1]. 
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4.4.4 Electromagnetic Polarization 

The electromagnetic wave emanating from a transmitting antenna is de¬ 
scribed by an electric field ^ and a magnetic field The wave neces¬ 
sarily has both an electric field and a magnetic field because, according to 
Maxwell’s equations, time varying electric fields induce time varying mag¬ 
netic fields, and time varying magnetic fields induce electric fields. At any 
point in space and at any time, the direction of the electric field, the di¬ 
rection of the magnetic field, and the direction of propagation of the wave 
are all mutually perpendicular. More specifically, 

^Direction of x ^Direction of ftj = (Direction of propagation). 

(4.18) 

An electromagnetic wave which varies with position in the same way 
that it varies with time is called a plane wave because planar wavefronts 
propagate at constant velocity in a given direction. For example, a sinu¬ 
soidal plane wave which travels in the positive dz direction is described 

by 

'^ = Eq cos (lO®t — 3 OO 2 :) dx- (4.19) 

For this plane wave, ^ is directed along d^, ^ is directed along dy, and 
the wave propagates in the dz direction. As another example, consider the 
plane wave described by 


'^ = Eo cos (lO®t 



(4.20) 


For this plane wave, the direction of ^ is 45° from the dx axis, the direction 
of is 45° from the dy axis, and again it propagates in the dz direction. 
Both of these electric fields describe sinusoidal plane waves because the 
electric field varies with position as it does with time, sinusoidally in both 
cases. 

We can classify plane waves by their eleetromagnetie polarization. Plane 
waves can be classified as linearly polarized, left circularly polarized, right 
circularly polarized, left elliptically polarized, or right elliptically polarized. 
In a previous chapter, we encountered the distinctly different idea of mate¬ 
rial polarization. Appendix C discusses overloaded terminology including 
the term polarization. 

Both of the electromagnetic waves described by Eq. 4.19 and by Eq. 
4.20 are linearly polarized. In both cases, the direction of the electric field 
remains constant as the wave propagates with respect to both position and 
time. If the direction of the electric field rotates uniformly around the 
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axis formed by the direction of propagation, the wave is called circularly 
polarized. If the direction of the electric field rotates nonuniformly, the wave 
is called elliptically polarized. For circularly polarized waves, the projection 
of the wave on a plane perpendicular to the axis formed by the direction 
of propagation is circular. For elliptical waves, the projection is elliptical. 
To determine if the polarization is left or right, point your right thumb in 
the direction of propagation, and compare the rotation of the electric field 
to the rotation of your fingers. If the rotation is along the direction of the 
fingers of your right hand, the wave is right polarized. Otherwise, it is left 
polarized. For example, the wave described by 


'^ = Eq cos (lO^t — 3 OO 2 ;) —^ + Eq sin (lO®t — 300;^) —^ (4-21) 

V 2 V 2 


is right circularly polarized. As another example, the wave 


^ = F'o cos (lO^t — 3 OO 2 :) ^ + Eo sin (l0®t — 300^) —^ 
is right elliptically polarized. The wave 




(4.22) 


'^ = Eq cos (l0®t — 3 OO 2 :) —^ — Eq sin (l0®t — 300^) —^ (4.23) 

V 2 V 2 


is left circularly polarized. These definitions are illustrated in the Fig. 4.5. 

What does electromagnetic polarization have to do with antennas? An¬ 
tennas may be designed to transmit linearly, circularly, or elliptically polar¬ 
ized signals. Antennas designed to transmit or receive circularly polarized 
signals often contain wires that coil in the corresponding direction around 
an axis. If a signal is transmitted with an antenna designed to transmit lin¬ 
early polarized waves, the best antenna to use as a receiver will be one that 
is also designed for linearly polarized waves. The signal can be detected by 
an antenna designed for signal of a different electromagnetic polarization, 
but the received signal will be noisier or weaker. Similarly, if a signal is 
transmitted with an antenna designed for right circular polarization, the 
best receiving antenna to use will be one also designed for right circular 
polarization. 
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Linearly Polarized 



Right Circularly Polarized Left Circularly Polarized 





Figure 4.5: Illustration of types electromagnetic polarization for a plane 
wave traveling in the direction. 



4 ANTENNAS 


85 


4.4.5 Other Antenna Considerations 

Antennas are made from good conductors. In Chapters 2 and 3, we saw 
that the materials that make up many energy conversion devices strongly 
influence the behavior. While the conductivity of conductors vary, overall 
the material that an antenna is made from does not signiflcantly affect its 
behavior. In addition to bandwidth, impedance, directivity, and electro¬ 
magnetic polarization, other factors, such as size, shape and configuration, 
distinguish one antenna from another. Mechanical factors should be consid¬ 
ered too. An ideal antenna may be one that is easy to construct or mount 
in the desired location, is portable, or requires little maintenance [50]. If an 
antenna is to be mounted outside, the antenna should be able to withstand 
snow, wind, ice, and other extreme weather [50]. While Maxwell’s equa¬ 
tions are useful for predicting the radiation pattern of an antenna, they do 
not provide information about these other factors. 

There is no perfect antenna. In one case, the best antenna may be a 
Yagi which is very directional and designed to operate within a narrow 
frequency band. In another application, the best antenna may be me¬ 
chanically strong and mounted in a way to withstand extreme wind [50, 
p. 17-29]. In another case, the best antenna may be portable and easy 
to set up by one person regardless of its radiation pattern [50, p. 21-26]. 
In another case, the best antenna may be a wire of an arbitrary length 
hanging from a tree because it was the easiest and quickest to construct. 
As with any branch of engineering, antenna design involves trade offs. For 
example, the best antenna to detect an 800 MHz linearly polarized signal 
is an antenna that is designed to detect 800 MHz signals, is designed to 
detect linearly polarized signals, is oriented in the proper direction, and 
has an impedance matched to the impedance of the transmission line used. 
The signal can still be detected using an antenna designed for a different 
frequency, designed for a different electromagnetic polarization, improperly 
directed, or with mismatched impedance. However, in all of these cases, a 
less intense signal will be received. 
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4.5 Problems 



Figure 4.6: A snow covered dish antenna. 

4.5 Problems 


4.1. An antenna is designed to operate between 4.98 GHz and 5.02 GHz, 
for a bandwidth of A/ = 0.04 GHz. Find AA, the wavelength range 
over which the antenna is designed to operate. 

Hint: The answer is NOT 7.5 m. 

4.2. Use the figure to find the following information. (Wires connecting 


to receiver or transmitter are not shown.) 



(a) Approximate the wavelength that antenna a is designed to op¬ 
erate at. 

(b) Approximate the frequency that antenna b is designed to operate 
at. 

(c) Which antenna most likely has parasitic elements: antenna a, 
antenna b, both, or neither? Explain your choice. 
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(d) Which antenna do you expect to be more isotropic: antenna 
a, antenna b, or would they be about the same? Explain your 
choice. 

(e) Which antenna is more likely to be used as a receiver than 
a transmitter: antenna a, antenna b, or both antennas about 
equal? Explain your choice. 


4.3. Some speculate that alien civilizations might be able to watch TV 
programs that escape the earth’s atmosphere. To get an idea of the 
likelihood for this to occur, consider an isotropic antenna in outer 
space transmitting a 200 MHz TV signal. 

Assume that the alien civilization uses an antenna with surface area 
0.5 m^ and has the technology to detect a signal with power as low as 
5 ■ 10“^^ W. What is the minimum power that must be transmitted 
for detection to occur at a distance of 1.0 light year? 

4.4. Project ELE, described in Sec. 4.4.1, was an extremely low frequency, 
76 Hz, radio system set by the military to communicate with sub¬ 
marines. It had facilities near Clam Lake, Wisconsin and Republic, 
Michigan, 148 miles apart [52]. Because these facilities were located 
a fraction of a wavelength apart, antennas at these locations acted as 
part of a single array. The length of all antenna elements was 84 miles 
[52]. Assume it took 18 minutes to transmit a three letters message 
using 8 bit ASCII, and assume signals travel close to the speed of 
light in free space. 

(a) Calculate the ratio of the distance between the transmitting fa¬ 
cilities to the wavelength. 

(b) Calculate the ratio of the length of all antenna elements to the 
wavelength. 

(c) What was the speed of communication in bits per second? 

(d) How many wavelengths long were each bit? 



4.5 Problems 


4.5. Match the following plots or antenna descriptions with their azimuth 
plots. 

1. An antenna with 3D plot shown below 



2 . An isotropic antenna 

3. An antenna with nulls at ±90° 

4. An antenna with a gain of around 19dB 



4.6. Radiation pattern plots are for a particular transmitting antenna are 
shown. They were plotted with EZNEC. The azimuth plot is on the 
left, and the elevation plot is on the right. The antenna is designed 
to operate at at 360 MHz. Use the plots to answer the following 
questions. 
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z 



(a) Assume a person 100 m away and receiving signal from the an¬ 
tenna in the front direction (along the (ix axis) receives a signal 
of 15 W. Approximately how strong of a signal would the person 
receive by standing 100 m away from the transmitter along the 
hy axis (in watts)? 

(b) Find the (power) F/B ratio in dB. 

(c) According to the azimuth plot, at approximately what angle are 
the nulls for this antenna? 

(d) What wavelength is this antenna designed to operate at? 

4.7. Figure 4.4 show the radiation pattern plots for a quad antenna de¬ 
signed to operate at / = 21.2 MHz. The upper left plot shows the 
azimuth plot, the upper right plot shows the elevation plot. The 
lower left plot shows the 3D radiation pattern, and the lower right 
plot shows the antenna elements. They were plotted with EZNEC 
software. 


(a) 

(b) 


Find the wavelength the antenna is designed to operate at. 

It, 


Find 

dB. 

Find 


*' f ront 


, the field front to back ratio of the antenna in 


dB 


- front 
^back 


dB 


, the power front to back ratio in dB. 


(d) 


Find 


^front 
^back 


lin 


, the power front to back ratio on a linear scale. 


Assume the electric field intensity 50 m away measured along 
the 4> = 45° axis (in the z = 0 plane) is 5 Find the electric 
field intensity 50 m away measured along the 0 = 135° axis (in 
the z = 0 plane). 
























90 


4.5 Problems 


4.8. Radiation pattern plots for a particular transmitting antenna are 
shown. They were plotted with EZNEC. The azimuth plot is on 
the left, and the elevation plot is on the right. 

Azimuth Elevation 




(a) Is this antenna isotropic? Justify your answer. 

(b) The antenna is designed to operate at a frequency of 187 MHz. 
What is the corresponding wavelength? 

(c) Eind the (power) E/B ratio in dB. 

(d) The signal 100 m from the transmitting antenna in the front 
direction (0 = 0) is measured to be |^| = 50 What is 

the electric field strength of the signal in ^ at 100 m from the 
antenna in the 0 = 45*^ direction? 

(e) Radiation pattern plots do not apply for all distances from the 
antenna. Roughly for what distances away are the radiation 
plots valid? 

4.9. Determine if the following electromagnetic waves are linearly polar¬ 
ized, right circularly polarized, left circularly polarized, right ellipti- 
cally polarized, or left elliptically polarized. All of these waves travel 
in the direction, and w is a constants. (This is a modified version 
of P3.34 from [11].) 

(a) = 10 cos {ut — 8z) CLx + 10 sin {ut — 8z) ay 

(b) = 10 cos (cut — 8z + CLx + 10 cos (a;f — 8z -|- |) ay 

(c) = 10 cos {ut — 8z) CLx — 20 sin {ut — 8z) cty 

(d) = 10 cos {ut — 8z) CLx — 10 sin {ut — 8z) hy 
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5 Hall Effect 

5.1 Introduction 

In this chapter we discuss another type of inductive energy conversion de¬ 
vice, the Hall effect device. While these devices may be made from con¬ 
ductors, they are more often made from semiconductors, like silicon, which 
are easily integrated into microelectronics. The Hall effect was discovered 
using gold by Edwin Hall in 1879 [57]. The first practical devices were 
produced in the 1950s and 1960s when uniform semiconductor materials 
were first manufactured [57]. 

Hall effect sensors are used to measure some hard to observe quantities. 
Without external tools, humans cannot detect magnetic field. However, a 
small, inexpensive Hall effect sensor can act as a magnetometer. Also, the 
Hall effect can be used to determine if a semiconductor is n-type or p-type. 
One of the first applications of Hall effect devices was in computer keyboard 
buttons [57]. Today, Hall effect devices are used to measure the rotation 
speed of a motor, as flow rate sensors, in multiple types of automotive 
sensors, and in many other applications. 

5.2 Physics of the Hall Effect 

Hall ejfect devices are direct energy conversion devices that convert energy 
from a magnetic field to electricity. The physics behind these devices is 
described by the Lorentz force equation. This discussion follows references 
[3] and [9]. If we place a charge in an external electric field, it will feel a 
force parallel to the applied electric field. If we place a moving charge in 
an external magnetic field, it will feel a force perpendicular to the applied 
magnetic field. The Lorentz force equation 

^ = + (5.1) 

describes the forces on the moving charge due to the external electric and 
magnetic fields. In the above equation, represents force in newtons on 
a charge moving with velocity it in units The quantity ^ represents 
the electric field intensity in units and B represents the magnetic flux 

density in units -^1^. Charge in coulombs is denoted by Q. Notice that the 
force on the charge due to the electrical field points in the same direction 
as the electrical field while the force on the charge due to the magnetic field 
points perpendicularly to both the velocity of the charge and the direction 
of the magnetic field. 
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The Hall effect occurs in both conductors and semiconductors. In con¬ 
ductors, electrons are the charge carriers responsible for the effect while in 
semiconductors, both electrons and holes are the charge carriers respon¬ 
sible for the effect [9]. A hole is the absence of an electron. Consider a 
piece of semiconductor oriented as shown in Fig. 5.1a. Assume the length 
is specified by /, the width is specified w, and the thickness is specified 
by dthick- For a typical Hall effect device, these dimensions may be in the 
millimeter range. Furthermore, assume the semiconductor is p-type with 
hole concentration p in units m“^. The charge concentration represents the 
net, or excess, charge density above a neutral material. Materials with a 
net negative charge, excess valence electrons, will have a positive value for 
the electron concentration n and are called n-type. Materials with a net 
positive charge, an excess of holes, will have a positive value for the hole 
concentration p which represents the density of holes in the material and 
are called p-type. Overall charge density is related to n and p by 

Pch = -qn -k qp (5.2) 

where q is the magnitude of the charge of an electron. 

Assume the semiconductor is placed in an external magnetic field ori¬ 
ented in the direction, with magnetic flux density 

^ = B^hz. 

Also assume a current is supplied through the semiconductor in the hx 
direction. The positive charge carriers in the semiconductor, holes, move 
with velocity if = Vxhx because current is the flow of charge per unit 
time. These measures are illustrated in Fig. 5.1b. Hall effect devices are 
typically used as sensors as opposed to energy harvesting devices because 
power must be supplied from this external current and because the amount 
of electricity produced is typically quite small. 

The force on the charges can be found from the Lorentz force equation. 
The force due to the external magnetic field on a charge of magnitude q is 
given by 

qlf = qvxhx x = -qB^hy (5.3) 

and is oriented in the —hy direction. Positive charges accumulate on one 
side of the semiconductor as shown in Fig. 5.1c. This charge build up 
causes an electric field oriented in the hy direction which opposes further 
charge build up. Charges accumulate until an equilibrium is reached when 
the forces on the charges in the hy direction are zero. 

y=o=(3(i?+vxif) 


(6.4) 
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The electric field intensity can be expressed as a function of the voltage 
Vab measured across the width of the device, in the cby direction. 

w 

= —qlt X ^ (5.6) 


(5.7) 

w 

While the magnitude of the velocity of the charges Vx is often not known, 
the applied current, Ix, in units amperes, is known. The current density 
through a cross section of the device is the product of the charge concen¬ 
tration, the strength of the charges, and the velocity of the charges. 


current density = — = q ■ Vx ■ p 

w-dthick ^ ^ 


( 6 . 8 ) 


From the above expression, velocity can be expressed in terms of the cur¬ 
rent. 


Vx 


4 

VJ ■ dthick ' Q ' P 


(6.9) 


Equations 5.7 and 5.9 can be combined. 




V! ■ Ix ' Bz 
W ■ dthick -q-p 


( 6 . 10 ) 


A magnetometer is a device that measures magnetic field. To use a 
Hall effect device as a magnetometer, start with a piece of semiconductor 
of known dimensions and known charge concentration, and then apply a 
current. If the voltage perpendicular to the current is measured, the mag¬ 
netic field can be calculated. The measured voltage is proportional to the 
strength of the external magnetic flux density. 


Bz = 


dthick ■ q ■ P ■ Vab 


( 6 . 11 ) 


Voltage is easily measured with a voltmeter, so no specialized tools are 
needed. To reliably measure this voltage, it is often amplified. 

Alternatively, if the strength of an external magnetic field is known, the 
Hall effect can be used to measure the concentrations of holes or electrons 
in a piece of semiconductor. With some algebra, we can write the hole con¬ 
centration as a function of the dimensions of the semiconductor, the known 
magnetic field strength, the applied current, and the measured voltage. 

_ Ix ■ Bz 

dthick ■ q ■ Vab 


( 6 . 12 ) 
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An analogous expression can be found if electrons instead of holes are the 
dominant charge carrier. The sign of this measured voltage is also used to 
determine whether a piece of semiconductor is n-type or p-type [58]. 

The Hall resistance Rh is a parameter inversely proportional to the 
charge concentration, and it has the units of ohms [9] [59]. For the as¬ 
sumptions above, the Hall resistance is defined as 


Rh 


QP I ■ dffiick 


(6,13) 


By combining Eqs. 5.12 and 5.13, it can be written in terms of the measured 
voltage and applied current. 


Rh 


Vab w 
R ' I 


(6,14) 


As an example, suppose that a piece of silicon with a hole concentration 
of p = 10^^ cm“^ is used as a Hall effect device. The device has dimensions 
I = 1 cm, w = 0.2 cm, and dthick = 0.2 cm, and it is oriented as shown 
in Fig. 5.1. The material has a resistivity of p = 0.9 fl-cm. A current 
of / = 1 mA is applied in the dx direction. The device is in an external 
magnetic field of ^ = 10“^a^ If a voltmeter is connected as shown 

in the figure, what voltage Vab is measured? 




IxBz 

Q ■ dffiick ■ P 


1-3 . 10-3 

1.6 ■ 10-19 ■ 0.2 ■ 10^7 


3.1 ■ 10-3 V 


(5.15) 


Signals in the millivolt range are easily detected with a standard voltmeter, 
yet signals in the microvolt range often can be measured with some ampli¬ 
fication. What output power is generated by this device? We can calculate 
the resistance along the dy direction. The resistivity of the silicon was given 
in the problem, and resistance R and resistivity p are related by 


R = 


p ■ length 
area 


The resistance across the width of the device is 

pw 0.09 ■ 0.2 


R. 


-width 


Id-, 


thick 


1 ■ 0.2 


= 0.09 n 


(5.16) 


(5.17) 


We can use this calculated resistance and the measured voltage to find the 
power converted from the magnetic field to electrical power of the device. 

P =—^ = 1.1 ■ 10-33 W (5.18) 

■^width 
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This amount of power is tiny. While this device can make a useful sensor, 
it will not make a useful energy harvesting device. It generates tens of 
picowatts of power, and a 1 mA current must be supplied to generate the 
power. 

5.3 Magnetohydrodynamics 

A magnetohydrodynamic device converts magnetic energy to or from elec¬ 
trical energy through the use of a conductive liquid or plasma. Similar to 
the Hall effect, the fundamental physics of the magnetohydrodynamic effect 
is described by the Lorentz force equation, Eq. 5.1. The difference is that 
the magnetohydrodynamic effect occurs in conductive liquids or plasmas 
while the Hall effect occurs in solid conductors or solid semiconductors. An¬ 
other related effect, which is also described by the Lorentz force equation, 
is the electrohydrodynamic effect, discussed in Sec. 10.6. The difference 
is that the magnetohydrodynamic effect involves magnetic fields while the 
electrohydrodynamic effect involves electric fields. 

Matter can be found in solid, liquid, or gas state. A plasma is another 
possible state of matter. A plasma is composed of charged particles, but a 
plasma has no net charge. When a solid is heated, it melts into a liquid. 
When a liquid is heated, it evaporates into a gas. When a gas is heated, the 
particles will collide with each other so often that the gas becomes ionized. 
This ionized gas is a plasma [3]. When ions in either a conductive liquid 
or a plasma flow in the presence of a magnetic field perpendicular to the 
flow of ions, a voltage is produced. 

This magnetohydrodynamic effect was first observed by Faraday in 1831 
[3]. In the 1960s, there was interest in building magnetohydrodynamic de¬ 
vices where the conducting medium was a plasma. These devices typically 
operated at high temperatures, in the range of 3000-4000 K [60]. Progress 
was limited, however, because few materials can withstand such high tem¬ 
peratures. More recently, engineers have used this principle to build pumps, 
valves, and other devices for microfluidic systems [61] [62]. These room 
temperature devices can control the flow of conducting liquids through the 
use of an external magnetic field. 



5 HALL EFFECT 


97 


5.4 Quantum Hall Effect 

Around a hundred years after the discovery of the Hall effect, the quantum 
Hall effect was discovered. Klaus von Klitzing discovered the integer quan¬ 
tum Hall effect in 1980 and won the physics Nobel prize for it in 1985 [63]. 
In 1998, Robert Laughlin, Horst Stormer, and Daniel Tsui won the physics 
Nobel prize for the discovery of the fractional quantum Hall effect [64]. The 
integer quantum Hall effect is observed in two dimensional electron gases 
which can occur, for example, in an inversion layer at the interface between 
the semiconductor and insulator in a MOSFET [59]. As in the Hall effect, 
a current is applied in one direction, and the Hall voltage is measured in 
the perpendicular direction. Following Fig. 5.1, assume that a current is 
applied along the direction in the presence of an external magnetic field 
in the direction. The voltage Vab is measured, and Hall resistance Rh 
is calculated. The quantum Hall effect is observed at low temperatures and 
in the presence of strong applied magnetic fields. In such situations, the 
Hall resistance has the form 

Rh = ^ (5.19) 

■ n 

where h = 6.626 ■ 10“^"^ J-s is the Planck constant and n is an integer [59]. 
This effect is called the quantum Hall effect because Rh can take only dis¬ 
crete values corresponding to integer values. Values of the Hall resistance 
can be measured extremely accurately, to 2.3 parts in 10^° [59]. The frac¬ 
tional quantum Hall effect is observed in highly ordered two dimensional 
electron gases in the presence of very strong magnetic fields, and it involves 
quantum mechanical electron-electron interactions [65]. 

The formal definition of the ohm relies on definitions of the meter, kilo¬ 
gram, and second. The kilogram is defined with respect to the weight of 
a physical object made of platinum and iridium housed in the Interna¬ 
tional Bureau of Weights and Measures in France [59]. Multiple national 
labs, including the National Institute of Standards and Technology in the 
United States, have come up with an experimental means of defining the 
ohm involving the quantum Hall effect. This standardized definition of the 
ohm is accurate to one part in 10® which is more accurate than previous 
definitions involving the kilogram, meter, and second [59]. Because of the 
high accuracy with which the integer quantum Hall effect can be measured, 
scientists have proposed using experiments involving it to standardize the 
measurement of the Planck constant and the definition of the kilogram 
instead of relying on a definition involving a physical object. These new 
standards have not been adopted yet, but they may be implemented as 
early as 2019 [66]. 
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5.5 Applications of Hall Effect Devices 

A Hall effect device is a simple device. It is essentially a piece of semicon¬ 
ductor with leads connected and calibrated for use. For this reason, Hall 
effect devices are inexpensive, small, and readily available. As with most 
integrated circuits, these devices are durable and long lasting because they 
have no mechanical moving parts [57]. 

Hall effect devices are available in two types: analog and digital. Analog 
Hall effect devices are typically integrated with an amplifier and circuitry 
to make the output more linear [57]. Some devices also contain circuitry 
to make the devices stable over a wider temperature range because the 
output of Hall effect sensors may be slightly temperature dependent [57]. 
The operating output voltage range of these devices is often limited by the 
amplifier circuit as opposed to the Hall effect sensor [57]. Digital Hall effect 
devices contain the Hall effect sensor integrated with additional circuitry 
such as a comparator to produce a digital output [57]. 

Analog Hall effect devices are used to sense magnetic field, temperature, 
current, pressure, position, and other parameters [57]. To make a Hall 
effect temperature sensor, for example, a magnet is mounted on a material 
that contracts or expands in the presence of a temperature change. As 
the magnet moves, it changes the magnetic field in a nearby Hall effect 
device and thereby generates a voltage across the Hall effect device. The 
same effect can be used to measure pressure or other parameters using 
a material that expands or contracts when the pressure changes or other 
parameter changes. Current flowing through a wire generates a magnetic 
field surrounding the wire. For this reason, the Hall effect can be used to 
make an ammeter that can be mounted nearby, as opposed to in the path 
of, the current. 

Digital Hall effect devices are used as switches or as buttons in a key¬ 
board. If a small magnet is mounted in a button, a Hall effect device can 
be used to sense when that magnet is pressed down near the Hall effect 
sensor. Hall effect devices can also be used as proximity sensors to detect 
the presence of nearby ferromagnetic objects [57]. Additionally, digital Hall 
effect devices are used in magnetic card readers [57]. One of the most com¬ 
mon applications is in tachometers, devices that measure rotation speed. 
To measure the rotation speed of a motor for example, the Hall effect sen¬ 
sor is mounted near a ferromagnetic gear. See Fig. 5.2. As a gear tooth 
passes the sensor, the magnetic field at the sensor changes, and a volt¬ 
age is induced across the Hall effect device. Hall effect sensors are used 
to measure rotation speed of motors, fans, tape machines, and disk drives 
[57]. Relatedly, Hall effect devices are used as flow rate sensors. These 
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Hall effect device 



Figure 5.2: Placement of Hall effect sensor used as a tachometer. 


sensors are found in devices ranging from water softeners to ocean current 
monitors [57]. To detect flow rate, a blade is mounted so that it rotates 
in the water flow. Magnets are mounted on the blade, and the Hall effect 
sensor is mounted nearby. When the blade passes the sensor, the magnetic 
held at the sensor changes and induces a voltage in the Hall effect sensor. 
Following the same principle. Hall effect sensors are used to measure the 
speed of paper flow in copiers, needles in sewing machines, drill bits in 
drilling machines, and bottles in bottling factories [57]. 

Multiple types of Hall effect devices are used in cars. Hall effect sensors 
are used as rotation sensors to detect transmission speed [57]. They are 
used as proximity sensors to detect the shift lever position, crank shaft 
position, and throttle position [57]. They are also used in door interlocks, 
in brake skidding detection, and in traction control systems [57]. 
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5.6 Problems 


5.1. Suppose that you are using a piece of semiconductor as a Hall effect 
device to measure a magnetic field. You supply a DC current through 
the device. You would like to replace the piece of semiconductor with 
another one that will give a larger output for the same external mag¬ 
netic field. List two ways you can change the piece of semiconductor 
so that the output would increase. (Specify both the property and 
whether it would need to be increased or decreased.) 


5.2. A piece of p-type semiconductor is used as a Hall effect device. The 
device has a thickness of dthick = 1 mm. It is placed in an external 
magnetic field of |^| = 10“^ A Hall voltage of 5 /iV is measured 
when a current of 3 mA is applied. Calculate p, the charge (hole) 
concentration in units 

5.3. A Hall effect device is used to measure the strength of an external 
magnetic field. The device is oriented in the way described in Fig. 
5.1. It is made from a cube of p-type silicon with hole concentration 
5 ■ 10^^ cm“^ where the length of each side of the cube is 1 mm. A 
current of 3 mA is applied through the device. The voltage measured 
across the device is 2.4 mV. Find the strength of the external magnetic 
flux density, |^|. 


5.4. 


A Hall effect device is used to measure the strength of an external 
magnetic field. The device is oriented in the way described in Fig. 
5.1. It is made from a material of length / = 3 mm, width w = 0.5 
mm, and thickness dthick = 0.5 mm. It has a hole concentration of 
p = 10^° m“^. In an experiment, the devices was placed in an external 


magnetic field of 


= 2.5 


Wb 

m2 


and a voltage of 9 mV was measured. 


What current was used in the experiment? 


5.5. Two expressions were given for the Hall resistance: 


Rh = ^ 


and Rh = J-. 


gp ^'dthick 

Show that both expressions have the units of ohms. 
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6 Photovoltaics 

6.1 Introduction 

This chapter discusses solar cells and optical detectors, both of which are 
devices that convert optical electromagnetic energy to electricity. The next 
chapter discusses lamps, LEDs, and lasers which convert energy in the op¬ 
posite direction. The photovoltaic effect is the idea that if a light shines on 
a pure piece of semiconductor, electron-hole pairs form. In the presence of 
an external electric field, these charges are swept apart, and a voltage de¬ 
velops across the terminals of the semiconductor. It was first demonstrated 
in 1839 by Edmond Becquerel. In a photovoltaic device, also called a solar 
cell, this effect typically occurs at a semiconductor pn junction. This same 
effect occurs on a smaller scale in photodiodes used to detect light and 
in optical sensors in digital cameras. To understand the physics behind 
these devices, we need to further study crystallography in semiconductors. 
Energy level diagrams, which illustrate the energy needed to remove an 
electron from a material, are another topic studied in this chapter. 

Unlike fossil fuel based power plants, photovoltaic cells produce energy 
without contributing to pollution. The solar power industry is growing at 
a fast pace. Worldwide as of April 2017, photovoltaic cells were capable of 
generating over 303 GW of power, and 75 GW of this total was installed 
within the past year [67]. This generating capacity was sufficient to satisfy 
1.8% of the worldwide demand for electricity [67]. In the United States as 
of April 2017, photovoltaic cells installed were capable of generating 14.7 
GW [67]. 

6.2 The Wave and Particle Natures of Light 

The physics of electromagnetic radiation is described by Maxwell’s equa¬ 
tions, Eqs. 1.5 - 1.8, and discussed in Sections 1.6.1 and 4.4.1. Optical 
energy is electromagnetic energy with wavelengths roughly in the range 

400 nm < A < 650 nm. 

This wavelength range corresponds to the frequency range 
4.6 ■ 10^^ Hz < / < 7.5 ■ 10^^ Hz. 

We often think of electromagnetic radiation as behaving like a wave. How¬ 
ever, it has both wave-like and particle-like behavior. 

One way to understand fight is to think of it as composed of particles 
called photons. A quantum is a small chunk, and a photon is a quantum. 
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small chunk, of light. A related quantity is a phonon, which is a quanta, 
or small chunk, of lattice vibrations. We will discuss phonons in a later 
section, and they do not relate to light. Although, phonons can perturb 
light, and that is the basis for acousto-optic devices. The second way to 
understand light is to think of it as a wave with a wavelength A measured 
in nm. White light has a broad bandwidth while the light produced by a 
laser has a very narrow bandwidth. 

These two descriptions of light complement each other. A photon is the 
smallest unit of light, and it has a particular wavelength. The energy of a 
photon of light with wavelength A is given by 

E = hf^ j. (6.1) 

The quantity h is called the Planck constant, and it has a tiny value, 
h = 6.626 ■ J ■ s. The quantity c is the speed of light in free space, 
c = 2.998- 10® 

In SI units, energy is measured in joules. However, other units are some¬ 
times used by optical engineers because the energy of an individual photon 
is tiny compared to a joule. Another unit that is used is the electronvolt, 
or eV. The magnitude of the charge of an electron is g = 1.602 ■ 10“^® C. 
The electronvolt is the energy acquired by a charge of this magnitude in 
the presence of a voltage difference of one volt [68, p. 8]. Energy in joules 
and energy in eV are related by a factor of q. 


-®[ J] = 1 ■ -®[eV] 


( 6 . 2 ) 


Equations 6.1 and 6.2 can be combined to relate the energy of a photon in 
eV and the corresponding wavelength in nm. 


1240 

A[nm] 


(6.3) 


Sometimes, energy is specified in the unit of wave number, cm“\ which 
represents the reciprocal of the wavelength of the corresponding photon. 
Energy in joules and energy in wave number are related by 



6.626 ■ 10-34 


^1J|’ 

J ■ s ■ 


he 

~~ T 

2.998 ■ lO® f 
A [cm] 


100 ^ 


(6.4) 

( 6 . 6 ) 


i?[j] 1.986-10 


( 6 . 6 ) 
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The human eye can sense light from approximately A = 400 nm to 
A = 650 nm. Using the expressions above, we can calculate in different units 
the energy range over which the human eye can respond. An individual 
red photon with A = 650 nm has energy 

Ered = 3.056 ■ 10“^^ J = 1.908 eV = 1.538 ■ 10^ cm”^ (6.7) 

in the different units. Similarly, an individual blue photon with A = 400 
nm has energy 

Euue = 4.966 ■ 10“^^ J = 3.100 eV = 2.500 ■ 10^ cm’^ (6.8) 

We can calculate the energy of individual photons of electromagnetic 
radiation at radio frequencies, at microwave frequencies, or in other fre¬ 
quency ranges too. For example, the radio station WEAX broadcasts with 
a frequency / = 88 MHz. This corresponds to a wavelength of A = 3.407 m. 
An individual photon at this frequency has energy 

E = 5.831 ■ 10-2® J = 3.640 ■ 10“^ eV. (6.9) 

As another example, wi-fi operates at frequencies near / = 2.4 GHz which 
corresponds to the wavelength A = 0.125 m. Each photon at this frequency 
has energy 

E = 1.590 ■ 10-2^ J = 9.927 ■ 10“® eV. (6.10) 

Ultraviolet light has a wavelength slightly shorter than blue light. A photon 
of ultraviolet light with wavelength A = 350 nm, which corresponds to 
frequency / = 8.57 ■ 10^^ Hz, has energy 

E = 5.676 ■ 10“^^ J = 3.543 eV. (6.11) 

X-rays operate at wavelengths near A = 10“^® m. An x-ray photon with 
wavelength A = 10“^® m has energy 

E = 1.986 ■ 10“^® J = 1.240 ■ 10^ eV. (6.12) 

Why do we talk about radio waves but not radio particles while we treat 
light as both wave-like and particle-like? A person is around 1.5 to 2 m 
tall. The wavelength of the radio station broadcast in the example above 
was \rf ~ 3.4 m while the wavelength of blue light was \uue light ~ 400 
nm. Both radio frequency and optical signals are electromagnetic radiation. 
Both are well described by Maxwell’s equations. Both have wave-like and 
particle-like properties. Humans typically talk about the wave-like nature 
of radio waves because they are on a scale we can measure with a meter 
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stick. However, with the correct tools, we can observe both the wave-like 
and particle-like behavior of light. 

Why is UV light more dangerous than visible light? Why are x-rays so 
dangerous? Each photon of x-ray radiation has around a thousand times 
more energy than a photon of green light. This type of radiation is called 
ionizing radiation because each photon has enough energy to rip an elec¬ 
tron from skin or muscles. UV radiation also has enough energy per photon 
to rip an electron off while red light and blue light do not have enough en¬ 
ergy. Photons of radio frequency and microwave electromagnetic radiation 
contain nowhere near enough energy per photon to do this damage. These 
types of radiation can still pose a safety hazard if enough photons land on 
your skin. Microwave ovens are used to cook food. However, they do not 
pose the hazards of ionizing radiation. 

6.3 Semiconductors and Energy Level Diagrams 

6.3.1 Semiconductor Definitions 

Some semiconductors are made up of atoms of a single type like pure Si 
or pure Ge. Others contain a combination of elements in column 13 and 
column 15 of the periodic table. Semiconductors of this type include AlAs, 
AlSb, GaAs, and InP. Other semiconductors contain a combination of el¬ 
ements in columns 12 and 16 of the periodic table. Examples of this type 
include ZnTe, GdSe, and ZnS [9]. Most semiconductors involve elements 
located somewhere near silicon on the periodic table, but more complicated 
compositions and structures are also possible. Materials made from three 
different elements of the periodic table are called ternary compounds, and 
materials made from four elements are called quaternary compounds. 

To understand the operation of devices like solar cells, photodetectors, 
and LEDs, we need to study the flow of charges in semiconductors. Electri¬ 
cal properties in semiconductors are determined by the flow of both valence 
electrons and holes. Valenee eleetrons, as opposed to inner shell electrons, 
are the electrons most easily ripped off an atom. A hole is an absence 
of an electron. Valence electrons and holes are known as eharge earri- 
ers because they are charged and they move through the semiconductor 
when an external voltage is applied. At a finite temperature, electrons are 
continuously in motion, and some electron-hole pairs may form an exei- 
ton. These electron-hole pairs naturally combine, also called decay, within 
a short time. However, at any time, some charge carriers are present in 
semiconductors at temperatures above absolute zero due to the motion of 
charges. 
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Crystalline semiconductors can be classified as intrinsic or extrinsic [9, 
p. 65]. An intrinsic semiconductor crystal is a crystal with no lattice 
defects or impurities. At absolute zero, T = 0 K, an intrinsic semiconductor 
has no free electrons or holes. All valence electrons are involved in chemical 
bonds, and there are no holes. At finite temperature, some charge carriers 
are present due to the motion of electrons at finite temperature. The 
concentration of these charge carriers is measured in units electrons holes 


electrons holes 


cm^ 


or 


cm» 


nP ’ m^ ’ 

The intrinsic carrier concentration is the density of 


electrons in a pure semiconductor, and it is a function of the temperature T. 
At higher temperatures, more charge carriers will be present even if there 
are no impurities or defects in the crystalline semiconductor due to more 
motion of charges. If we apply a voltage across an intrinsic semiconductor 
at T = 0 K, no charges how. When the equilibrium concentration of 
electrons n or holes p is different from the intrinsic carrier concentration 
Hi then we say that the semiconductor is extrinsic. If either impurities or 
crystal defects are present, the material will be extrinsic. If a voltage is 
applied across an extrinsic semiconductor at T = 0 K, charges will how. If 
a voltage is applied across either an extrinsic or intrinsic semiconductor at 
temperatures above absolute zero, charge carriers will be present and will 
how. 


The process of introducing more electrons or holes into a semiconductor 
is called doping. A semiconductor with an excess of electrons compared 
to an intrinsic semiconductor is called n-type. A semiconductor with an 
excess of holes is called p-type. Silicon typically has four valence electrons 
which are involved in bonding. Phosphorous has five valence electrons, and 
aluminum has three. When a phosphorous atom replaces a silicon atom in 
a silicon crystal, it is called a donor because it donates an electron. When 
an aluminum atom replaces a silicon atom, it is called an acceptor. Column 
15 elements are donors to silicon and column 13 elements are acceptors. If 
silicon is an impurity in AlP, it may act as a donor or acceptor. If it replaces 
an aluminum atom, it acts as a donor. If it replaces a phosphorous atom, 
it acts as an acceptor. 

How can we dope a piece of silicon? More specifically, how can we dope 
a semiconductor with boron? Boron is sold at some hardware stores. It is 
sometimes used as an ingredient in soap. Start with a silicon wafer, and 
remove any oxide which has formed on the surface. Each silicon atoms 
forms bonds with four nearest neighbors. At the surface though, there is 
no fourth neighbor, so silicon atoms bond with oxygen from the air. Smear 
some boron onto the wafer, or place a chunk of boron on top of the wafer. 
Place it in a furnace at slightly less than silicon’s melting temperature. 
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around 1000 °C. Some boron will diffuse in and replace silicon atoms. 
Remove the excess boron. The same procedure can be used to dope with 
other donors or acceptors. What is the most dangerous part of the process? 
Etching the oxide off the silicon because hydrofluoric acid HE, a dangerous 
acid, is used [69]. 

Sometimes it is possible to grow one layer of a semiconductor mate¬ 
rial on top of a layer of a different type of material. A stack of different 
semiconductors on top of each other is called a heterostructure. Not all 
materials can be made into heterostructures. GaAs and AlAs have almost 
the same atomic spacings, so heterostructures of these materials can be 
formed. The spacing between atoms, also called lattice constant, in AlAs is 
0.546 nm, and the spacing between atoms in GaAs is 0.545 nm [9]. If the 
atomic spacing in the two materials is too different, mechanical strain in 
the resulting material will pull it apart. Even moderate mechanical strain 
can negatively impact optical properties of a device because defects may 
be introduced at the interface between the materials. These defects can 
introduce additional energy levels which can trap charge carriers. 

6.3.2 Energy Levels in Isolated Atoms and in Semiconductors 

In a solar cell, light shining on a semiconductor causes electrons to flow 
which allows the device to convert light to electricity. How much energy 
does it take to cause an electron in a semiconductor to flow? To answer 
this question, we will look at energy levels of: 

• An isolated A1 atom at T = 0 K 

• An isolated P atom at T = 0 K 

• Isolated A1 atom and P atoms at T > 0 K 

• An AlP crystal at T = 0 K 

• An AlP crystal at T > 0 K 

Aluminum has an electron configuration of ls^2s^2p®3s^3p^. It has 13 total 
electrons, and it has 3 valence electrons. More speciflcally, it has two va¬ 
lence electrons in the 3s subshell and one in the 3p subshell. Phosphorous 
has an electron configuration of ls^2s^2p®3s^3p^, so it has 5 valence elec¬ 
trons. Ideas in this section apply to materials regardless of whether they 
are crystalline, amorphous, or polycrystalline. 
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Energy Levels of Electrons of Isolated A1 and Isolated P Atoms 
at T = 0 K 

To understand the interaction of light and a semiconductor, start by con¬ 
sidering an isolated A1 atom and an isolated P atom at absolute zero, 
T = 0 K. How much energy does it take to rip off electrons of Al? It takes 
significantly less energy to rip off a valence electron than an electron from 
an inner shell. In fact, when we say an electron is a valence electron, or 
an electron is in a valence shell, we mean that the electron is in the shell 
for which it takes the least energy to rip off an electron. We do not mean 
that the electron is further from the nucleus, although often it is. When 
we say an electron is in an inner shell, we mean the electron is in a shell 
for which it takes more energy to rip off an electron. This text focuses on 
energy conversion devices which operate at moderate energies, so all of the 
devices discussed involve interactions of only valence electrons. Inner shell 
electrons will not be involved. It is also possible to excite, but not rip off, 
an electron. When an electron is excited, its internal momentum changes 
and its quantum numbers change. The terms valence electron and quantum 
number were both defined in Sec. 1.5.2. Less energy is required to excite 
than rip off an electron. The energy required to excite or rip off electrons 
can be supplied by thermal energy, an external voltage, an external optical 
field, or other external sources. 

Figure 6.1 is a plot of the energy required to excite or remove electrons 
from an isolated neutral Al atom at T = 0 K. The figure was plotted 
using data from [70]. While energy levels are drawn using actual data, 
the thickness of the lines is not drawn to scale. Energy is on the vertical 
axis. Allowed energy levels are shown by horizontal lines. Each electron 
can only have energy corresponding to one of these discrete possible energy 
levels. At T = 0 K, electrons occupy the lowest possible energy levels. One 
electron can occupy each line, so the lowest 13 energy levels are occupied by 
electrons. While not shown due to the resolution of the figure, the density 
of allowed energy levels increases as energy approaches zero at the top of 
the figure. Since we are considering the case of absolute zero temperature, 
these upper energy levels are not occupied by electrons. 

The left side of Eig. 6.2 replots the allowed energy levels of the electrons 
in an isolated Al atom at T = 0 K. The energy levels are also labeled. The 
right side of the figure plots the allowed energy levels of electrons in an 
isolated P atom also at T = 0 K. Data on phosphorous energy levels also 
comes from [70]. As with the Al atom, the electrons of the P atom can 
only occupy certain specific discrete energy levels. Since the atoms are at 
absolute zero, the electrons occupy the lowest energy levels possible. Eigure 
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6.3 contains the same information, but is zoomed in vertically to show the 
valence electron levels more clearly. 

The P atom has two more electrons than the A1 atom. Phosphorous 
atoms have more protons, so the electrons are a bit more tightly bound to 
the nucleus. For this reason, it takes a bit more energy to rip the electrons 
off, and the allowed energy levels are a bit different than for Al. 

The amount of energy required to rip a 3p electron off the atom is 
the vertical distance from the 3p level to the ground line at the top of the 
figure. The amount of energy required to rip a 2p electron off is the vertical 
distance from the 2p level to the ground line. As expected, these figures 
show that it requires more energy to rip off the inner shell 2p electron than 
the valence shell 3p electron. If enough energy is supplied, an electron will 
be ripped off, and the electron will flow freely through the material. If 
some energy is supplied but not enough to rip off the electron, the electron 
can get excited to a higher energy level. The energy required to excite an 
electron is given by the vertical distance in the figure from an occupied to 
an unoccupied energy level. In either case, we say that an electron-hole pair 
forms. If the amount of energy supplied is too small to excite an electron 
from a filled to unfilled state, the external energy will not be absorbed. 


Energy Levels of Electrons of Isolated Al and Isolated P Atoms 
at T > 0 K 

How do the energy levels change when the Al and P atoms are at tem¬ 
peratures above absolute zero, where electrons are continuously vibrating 
and moving? First, the energy levels broaden. The electrons can still only 
take certain energy levels, but there is a wider range to the allowed en¬ 
ergy levels. Second, occasionally, electrons spontaneously get excited into 
higher states. For example, a 3p electron may get excited into the 4s state 
temporarily. If it does, it will quickly return to the ground state. 

Energy Levels of AlP at T = 0 K 

How much energy does it take to rip an electron off an AlP crystal at 
T = 0 K? The three valence electrons of each Al atom and the five valence 
electrons of each P atom form chemical bonds. The energy required to rip 
off these electrons is slightly different than the energy required to rip off 
the equivalent electrons of isolated Al and isolated P atoms. Figure 6.4 
illustrates the energy levels of the valence electrons of AlP. Unlike in the 
previous figures, these energy levels do not come from actual data. Instead, 
they are meant as a rough illustration of the effect. The amount of energy 



Energy in eV 



re 6.3: Zoomed in version of the energy level diagram of 
im and ohosohorous atoms at T = 0 K olotted usine: dat 







112 


6.3 Semiconductors and Energy Level Diagrams 


0 


100 


200 


300 

> 

0 ) 

c 

>^ 

o 

c 

LU 

400 


500 


600 


700 


Figure 6.4: Energy level diagram at T = 0 K of an isolated aluminum 
atom, AlP crystal, and isolated phosphorous atom. Energy levels for the 
isolated atoms are from [70]. Energy levels for AlP are a rough illustration 
and not from actual data. 
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Figure 6.5: Energy level diagram of a semiconductor zoomed in to show 
only the conduction and valence band. 


required to rip off an electron is represented on the energy level diagram 
by the vertical distance from that level to the ground level at the top of 
the diagram. The energies needed to remove inner shell electrons do not 
significantly change from the energy levels of isolated atoms. 

Energy levels due to electrons shared amongst atoms in a solid semi¬ 
conductor are called energy bands. The filled energy level closest to the 
top of an energy level diagram for a semiconductor is called the valence 
band. The energy level above it is called the conduction band. The energy 
gap Eg, also called the bandgap, is the energy difference from the top of 
the valence band to the bottom of the conduction band. The term valence 
electron refers to an outer shell electron while the term valence band refers 
to a possible energy level it may occupy. At T = 0 K, the valence band is 
typically filled, and the conduction band may be empty or partially empty. 
We often are only interested in the valence and conduction bands because 
we are interested in energy conversion processes involving small amounts 
of energy. Eor this reason, we often plot energy level diagrams zoomed in 
vertically to just show these two energy levels as shown in Eig. 6.5. 

If the AlP crystal has defects or impurities, the energy levels broaden 
a bit because the electrical potential (in volts) seen by each A1 and each P 
atom is slightly different from the potential seen by other A1 and P atoms 
in the crystal. Thus, it takes slightly different amounts of energy to rip off 
each electron. Eor this reason, energy levels in amorphous materials are 
quite a bit broader than energy levels in crystals of the same composition 
[10]. If the AlP crystal has defects or impurities, additional allowed energy 
levels may be present. Some of these energy levels may even fall within the 
energy gap. 


Energy Levels of AlP at T > 0 K 

As with isolated atoms, there are two differences between energy levels for 
crystals such as AlP at T > 0 K compared to at T = 0 K. Eirst, energy 
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levels broaden. Second, some electrons get excited to higher energy levels 
and quickly, perhaps in a few microseconds, decay back down. 

6.3.3 Definitions of Conductors, Dielectrics, and Semiconduc¬ 
tors 

Conductors, dielectrics, and semiconductors were defined in section 1.5.1. 
Now that we have seen example energy level diagrams, we should revisit 
these definitions as well as define the term semimetal. In the presence of an 
applied external voltage, electric field, optical field, or other energy source, 
valence electrons fiow easily in a conductor [10, p. 429] [11, ch. 4]. In a 
conductor, the conduction band is partially filled with electrons, so there 
are many available energy states for electrons remaining in the conduction 
band. With just a little bit of external energy, possibly even from vibrations 
that naturally occur at T > 0 K, valence electrons fiow easily. Inner shell 
electrons can be ripped off their atoms and fiow, but significantly more 
energy is needed to rip off inner shell than valence electrons. 

In the presence of an applied external voltage, electric field, optical field, 
or other energy source, electrons do not fiow easily in an insulator [10, p. 
429] [11, ch. 4]. The valence band is filled and the conduction band is 
empty. The energy gap between valence band and conduction band in an 
insulator is typically above 3 eV. A little heat or energy from vibrations is 
not enough to excite an electron from one allowed energy state to another. 
If a large enough external source of energy is applied, though, an electron 
can be excited or ripped off of an insulator. 

In Sec. 3.3, electro-optic materials were discussed. Some insulators 
are electro-optic which means that in the presence of an external electric 
or optical field, the spatial distribution of electrons changes slightly which 
cause a material polarization to build up. Photons of the external electric 
or optical field in this case do not have enough energy to excite electrons 
in the insulator, so the internal momentum of electrons in the material 
does not change. The electro-optic effect occurs in insulators and involves 
external energies too small to excite electrons from one allowed energy state 
to another while the affects discussed in Sec. 6.3 involve semiconductors 
and external energies large enough to excite electrons from one energy level 
to another. 

At T = 0 K in a semiconductor, the valence band is full, and the 
conduction band is empty. The energy gap of a semiconductor is small, in 
the range 0.5 eV ^ Eg <3 eV. In the presence of a small applied voltage, 
electric field, or optical field, a semiconductor acts as an insulator. In the 
presence of a large applied voltage or other energy source, a semiconductor 
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acts as a conductor, and electrons flow. Photodiodes and solar cells are 
made from semiconductors. If enough energy is supplied to a photodiode, 
for example from an optical beam, the valence electrons will flow. More 
speciflcally, the photons of the external optical beam must have more energy 
than the energy gap of the semiconductor for the valence electrons to flow. 

The term semimetal is used to describe conductors with low electron 
concentration. Similar to conductors, in a semimetal at T = 0 K, there 
is no energy gap because the conduction band is partially fllled with elec¬ 
trons, and there are plenty of available energy states. The concentration of 
electrons for semimetals, however, is in the range n < 10^^ while 

n is greater for conductors [26, p. 304]. 

6.3.4 Why Are Solar Cells and Photodetectors Made from Semi¬ 
conductors? 

Energy level diagrams for AlP were illustrated above. The energy gap of 
AlP is Eg = 2.45 eV, so it is a semiconductor [9] [10, p. 432,543]. If a beam 
of light with photons of energy E < 2.45 eV is applied to a piece of AlP, the 
photons will not be absorbed, and no electrons will be excited. If a beam 
of light with photons of energy E > 2.45 eV is applied to a piece of AlP, 
some of those photons may be absorbed. When a photon is absorbed, an 
electron will be excited from the valence band to the conduction band. A 
blue photon with energy E = 3.1 eV will be absorbed by AlP, for example, 
but a red photon with energy E = 1.9 eV will not. When the electron is 
excited, the internal momentum of the electron necessarily changes. The 
excited electron quickly spontaneously decays back to its lowest energy 
state, and it may emit a photon or a phonon in the process. If a beam 
of light with photons of significantly higher energy is applied to a piece of 
AlP, it is possible to rip off electrons entirely from their atom. 

Why are solar cells and optical photodetectors made from semicon¬ 
ductors instead of insulators? Sunlight is composed of light at multiple 
wavelengths, and it is most intense at wavelengths that correspond to yel¬ 
low and green light. Green photons have energies near E ^ 2.2 eV, and 
visible photons have energies in the range 1.9 eV < E < 3.1 eV. Solar 
cells are made from materials with an energy gap less than the energy of 
most of the photons from sunlight. Semiconductors are used because the 
energy of each photon is large enough to excite the electrons in the mate¬ 
rial. Insulators are not used because most of the photons of visible light do 
not have enough energy to excite electrons in the material. The material 
should not have an energy gap that is too large otherwise photons will not 
be absorbed. 
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Material 

Gap in 
eV 

GaP 

2.26 

GaAs 

1.43 

GaSb 

0.70 


Material 

Gap in 
eV 

AlP 

2.45 

GaP 

2.26 

InP 

1.35 


Material 

Gap in 
eV 

ZnS 

3.6 

ZnSe 

2.7 

ZnTe 

2.25 


Table 6.1: Energy gap of various semiconductors. 


Why are solar cells and optical photodetectors made from semiconduc¬ 
tors instead of conductors? When light shines on a solar cell or photodetec¬ 
tor, photons of light are absorbed by the material. If the photon absorbed 
has energy greater than the energy gap of the material, the electron quickly 
decays to the top of the conduction band. With some more time, it de¬ 
cays back to the lowest energy state. In a solar cell or photodetector, a pn 
junction is used to cause the electrons to flow before decaying back to the 
ground state. The amount of energy converted to electricity per excited 
electron depends on the energy gap of the material, not the energy of the 
incoming photon. Only energy Eg per photon absorbed is converted to 
electricity regardless of the original energy of the photon. Thus, the energy 
gap of the material used to make a solar cell or photodetector should be 
large so that as much energy per excited electron is converted to electricity 
as possible. The material should not have an energy gap that is too small 
otherwise very little of the energy will be converted to electricity. The elec¬ 
tron and hole will release the excess energy, hf — Eg, quickly in the form 
of heat or lattice vibrations called phonons. 

Each semiconductor has a different energy gap Eg. Many solar cells and 
photodetectors are made from silicon, which is a semiconductor with Eg = 
1.1 eV. Predicting the energy gap of a material is quite difficult. However, 
all else equal, if an element of a semiconductor is replaced with one below 
it in the periodic table, the energy gap tends to get smaller. This trend is 
illustrated in Table 6.1. Data for the table comes from [9]. This trend is 
also illustrated in Eig. 6.6, which plots the energy gap and lattice constant 
for various semiconductors. Eigure 6.6 is taken from reference [71]. The 
horizontal axis represents the interatomic spacing in units of angstroms, 
where one angstrom equals 10^° meters. The vertical axis represents the 
energy gap in eV. This figure illustrates energy gaps and lattice constants 
for materials of a wide range of compositions. Eor example, the energy 
gap for aluminum phosphide can be found from the point labeled AlP, and 
the energy gap of aluminum arsenide can be found from the point labeled 
AlAs. Energy gap for semiconductors of composition AlAsj,Pi_a, can be 
found from the line between these points. 
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Figure 6.6: Energy gap versus interatomic spacing for multiple semicon¬ 
ductors. Used with permission from [71]. 


Some solar cells are made from layered material with the largest energy 
gap material on the top. For example, a solar cell could be made from a top 
layer of ZnS, a middle layer of ZnSe, and a bottom layer of ZnTe. Photons 
with energy E > 3.6 eV would be absorbed in the ZnS layer. Photons 
with energy 2.7 eY< E <3.6 eV would be absorbed by the ZnSe layer, and 
photons with energy 2.25 eY< E <2.7 eV would be absorbed by the ZnTe 
layer. Each photon of energy absorbed by the ZnS layer and converted 
to electricity would have more energy than each photon absorbed by the 
ZnSe layer. Solar cells made from layers in this way can be more efficient at 
converting energy from optical energy to electricity than equivalent solar 
cells made of a single material. 

The photo in Fig. 6.7 shows naturally occurring zinc sulfide, also called 
sphalerite, collected near Sheffier’s Rock shop near Alexandria, Missouri. 
The dark mineral embedded in the middle of the rock is the sphalerite. 

6.3.5 Electron Energy Distribution 

The Fermi energy level of a semiconductor, denoted Ef, represents the 
energy level at which the probability of finding an electron is one half [9] 
[10, p. 432,543]. The Fermi level depends on temperature, and it depends 
on the impurities in the semiconductor. Chemists sometime call the Fermi 
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Figure 6.7: The dark mineral embedded in the rock is naturally occurring 
zinc sulfide. 

level by the name chemical potential, fichem- 

In a pure semiconductor at T = 0 K, all electrons occupy the lowest 
possible states. The valence band is completely filled, and the conduction 
band is completely empty. The Fermi level, Ef, is the energy level at the 
middle of the energy gap. No electrons are found at energy Ef because no 
electrons can have an energy inside the energy gap. However, the Fermi 
level is a useful measure to describe the material. 

In a pure semiconductor at T > 0 K, some electrons are excited into 
higher energy levels. As the temperature increases, more electrons are likely 
to be found at higher energy levels more often. The probability that an 
electron is in energy level E varies with temperature as jgj 

The quantity ks is the Boltzmann constant. 



(6,13) 


The Fermi level for a material with T > 0 K is slightly higher than the 
Fermi level for a material with T = 0 K because more electrons are likely 
to be excited. 

The probability of finding an electron at energy level E at temperature 


T is 



(6.14) 


Equation 6.14 is called the Fermi Dirac distribution, and like any probabil¬ 
ity, it ranges 0 < F < 1. For energy levels far above the conduction band, 
{E — Ef) is large and positive, so electrons are quite unlikely to be found, 
F 0. For energy levels far below the valence band, (F — Ef) is large and 
negative, so electrons are quite likely to be found, F 1. 
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The concentration and type of impurities influence the energy of the 
Fermi level. A p-type material has a lack of electrons. For this reason in 
a p-type material, Ef is closer to the valence band than the middle of the 
energy gap. An n-type material has an excess of electrons. For this reason 
in a n-type material, Ef is closer to the conduction band. 


6.4 Crystallography Revisited 

6.4.1 Real Space and Reciprocal Space 

Physicists and chemists are often interested in where electrons or nucleons 
of atoms are likely to be found with respect to position in real space. Ideas 
of a lattice, basis, and crystal structure were discussed in Sec. 2.3.2. To 
review, a lattice describes the arrangements of points. The basis describes 
how atoms are arranged at each lattice point. The lattice and basis together 
form the crystal structure. A 3D lattice is described by three lattice vectors 
Oi, at, and at- If they are chosen as short as possible, they are called 
primitive lattice vectors. The magnitude of a primitive lattice vector may 
be around 0.1 nm. The primitive lattice vectors define a cell called a 
primitive cell. Since a lattice is periodic, if we know how to describe one 
primitive cell, we can describe the entire lattice. 

For each lattice, there is a corresponding reciprocal lattice defined by 
a set of vectors. Both contain the same information in different forms. 
For a 3D lattice with primitive vectors at, ^d at, ^e vectors of the 
reciprocal lattice are labeled by the vectors hi, 62 , and 63 . 

271 at X at 
bi = ^^^ 
ai ■ 02 X 03 



271 at X at 
02 = ^^ 

Oi ■ 02 X 03 


(6.16) 


27r at X at 

03 = ^^ 

Oi ■ 02 X 03 


(6.17) 


Notice that hi is perpendicular to at and at. Also, hi is parallel to at. 
More speciflcally, |6i | ■ |at| = 27r. (Factors of 27r show up due to choice of 

units, vs ^^.) Thus if vector at is long, ^ will be short. Just as 

we can get from one lattice point to another by traveling integer multiples 
of the at lattice vectors, we can get from any one poin^to the next of 
the reciprocal lattice by traveling integer multiples of the lattice vector. 
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Lattice vectors in real space have units of length, m. Lattice vectors in 
reciprocal space have units m“^. 

The reciprocal lattice gives information about the spatial frequency of 
atom s. If the planes of atoms in a crystal are closely spaced in one direction, 

I Oil is relatively small. The corresponding reciprocal vector |6i | is relatively 
large. The reciprocal lattice represents the spatial frequency o f th e atom 
in units m“^. If the planes of atoms in a crystal are far apart, |ai| is large 
and I &i I is small. 

If a beam of light shines on a crystal where the wavelength of light 
is close to the crystal spacing, light will be diffracted, and the diffraction 
pattern is related to the reciprocal lattice. The Brillouin zone is a primitive 
cell for a reciprocal lattice. The volume of a unit cell in reciprocal space 
over a unit cell in real space is given by 


vol. Brillouin zone 
vol. primitive cell in real space 


hi ■ 62 


X 


of ■ (I2 X 


= (2ir)“ 


(6,18) 


As for the real space lattice, to understand the reciprocal space lattice, 
we need to only understand one cell because the reciprocal space lattice is 
periodic. 


6.4,2 E versus k Diagrams 

The energy level diagrams, discussed in Section 6.3, plot allowed energies 
of electrons where the vertical axis represented energy. No variation is 
shown on the horizontal axis. The most useful energy level diagrams for 
semiconductors are zoomed in so that only the valence and conduction 
band are shown. In many cases, it is useful to plot energy level diagrams 
versus position in real space. For such a diagram the vertical axis represents 
energy, and the horizontal axis represents position. It is also useful to plot 
energy level diagrams versus position in reciprocal space. 

Kinetic energy is given by 

Ekinetic = (6-19) 

where represents velocity in ™ and m represents mass in kg. Momen¬ 
tum is given by M = mlf in units Electrons in crystals at 

T > 0 K vibrate, and certain vibrations are resonant in the crystal. The 
crystal momentum M crystal represents the internal momentum of due to 
vibrations. It can be expressed as 

^ crystal = ht ( 6 . 20 ) 
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Direct Semiconductor Indirect Semiconductor 

E E 




Figure 6.8: Energy plotted vs. \ k\ for a direct and indirect semiconductor. 


and has units of momentum The quantity k is called the wave 

vector, and it has units m“^. It represents change in spatial frequency, a 
distance in reciprocal space. The constant 


h = 


h 


( 6 , 21 ) 


is called h-bar and is the Planck constant divided by 2n. Kinetic energy 
can be written in terms of the wave vector. 


E 


kinetic — 


h^\k |2 
2m 


( 6 . 22 ) 


^nation 6.22 describes how energy of an electron varies with wave vector 
I k I which incorporates information about lattice vibrations. The energy 
is quadratic in wave vector, so plots of energy versus | k \ are parabolic. 
Equation 6.22 is just a model, and it applies best near the top of the 
valence band and b^tom of the conduction band. 

Energy versus | k \ diagrams plot allowed energy levels. Think of the 
I k I axis as change in position in reciprocal space. If the top of the valence 
band and bottom of the conduction band occur at the same | k \ value 
in a semiconductor, we say that it is direct. If the top of valence band 
and bottom of conduction band occur at different | k \ values, we say that 
the semiconductor is indirect. The left part of Eig. 6.8 shows an energy 
versus | k \ diagram for a direct semiconductor, and the right part of Eig. 
6.8 shows one for an indirect semiconductor. GaAs, InP, and ZnTe are 
direct semiconductors. Si, Ge, AlAs, and GaP are indirect semiconductors. 
Along different crystal axes, the band structure changes somewhat. The 
horizontal axis of an energy versus | k \ diagram may be specified along a 
particular axis in reciprocal space. 
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Indirect Semiconductor 
E 



Figure 6.9: Two possible mechanisms of photon absorption in an indirect 
gap semiconductor. 


What happens when we shine light on a direct semiconductor? A pho¬ 
ton of sufficient energy can excite an electron from the valence band to the 
conduction band to create an electron-hole pair. What happens in an indi¬ 
rect semiconductor? Figure 6.9 illustrates two possibilities. As illustrated 
by the longer arrow, an electron can be excited directly from the valence to 
conduction band. However, this requires a photon of more energy than the 
vertical distance between the top of the valence band and the bottom of 
the conduction band [25, p. 200]. Alternatively, as illustrated by the other 
two arrows, excitation from the top of the valence band to the bottom of 
the conduction band may involve a photon and a phonon. Both energy and 
momentum must be conserved, so a change in crystal momentum is needed 
to excite an electron in this case. Solar cells and photodetectors may be 
made from either direct or indirect semiconductors. 


6.5 Pn Junctions 

Many devices, including photovoltaic devices, LEDs, photodiodes, semi¬ 
conductor lasers, and thermoelectric devices are essentially made from pn 
junctions. To understand photovoltaic devices and these other energy con¬ 
version devices, we need to understand pn junctions. Consider a semicon¬ 
ductor crystal composed of an n-type material (with excess electrons) on 
one side and a p-type material (lacking electrons, in other words, with an 
excess of holes) on the other side. The junction of the p-type and n-type 
materials is called a pn junction. Assume the junction is abrupt and is at 
thermal equilibrium. 

Some pn junctions are made from elemental semiconductors like Si, and 
other pn junctions are made from compound semiconductors like GaAs. 
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Figure 6.10: Energy level diagram of p-type and n-type semiconductors. 


Some pn junctions have the same material on both sides while other pn 
junctions have different materials on either side. For example, a pn junction 
can be made from an n-type layer of GaAs and a p-type layer of GaAs. It 
can also be made from an n-type layer of GaAs and a p-type layer of AlAs. 

What happens when we put a p-type material and an n-type material 
together to form a pn junction? Valence electrons and holes move. Nuclei 
and inner shell electrons do not. Some excess electrons from the n-type 
region go towards the p-type region. Some excess holes from the p-type 
region go towards the n-type region. These charge carriers diffuse, are 
swept away from, a region near the junction. This region near the junction 
which is lacking charge carriers is called the depletion layer [10, p. 564]. 
As shown in Fig. 6.10, the Fermi level Ef is near the valence band for 
p-type materials. P-type material lacks electrons, so the energy where it is 
equally likely to find an electron state occupied and unoccupied is closer 
to the valence band. For a similar reason, the Fermi level Ef is near the 
conduction band for n-type materials. Figure 6.11 shows the energy level 
diagram versus position for the pn junction, and Fermi levels of the two 
materials are lined up in this figure. 

Gonsider a junction where the n-type material is silicon doped with 
phosphorous atoms and the p-type material is silicon doped with aluminum 
atoms. The n-type side of the pn junction has an excess of positive charges 
because some phosphorous atoms replace Si atoms in the material. Phos¬ 
phorous atoms have one more proton than silicon atoms. They also have 
one more electron, but the valence electron is a charge carrier which dif¬ 
fuses away from the junction. Similarly, the p-type side of the junction has 
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an excess of negative charges because some aluminum atoms replace silicon 
atoms. Aluminum atoms have one less proton than Si atoms. They also 
have one less electron, but the hole is a charge carrier which also diffuses 
away from the junction. 

An electric field forms across the junction due to the net charge dis¬ 
tribution near the junction. Electric field intensity is the force per unit 
charge, and it has the units There is also necessarily a voltage drop 
across a pn junction in equilibrium, and this voltage is called the contact 
potential Vo in the units of volts. While the contact potential is a voltage, 
it cannot be measured by placing a voltmeter across a pn junction because 
additional junctions would be formed at each lead of the voltmeter with 
additional voltages introduced [9, p. 141]. 

Figure 6.11 illustrates the energy level diagram of a pn junction. The 
horizontal axis represents position, and the vertical axis represents energy. 
It is related to the figures in Section 6.3. However, Fig. 6.11 is zoomed in 
vertically, and it is plotted versus position near the junction. It also shows 
the relationship between the energy level diagram and the circuit symbol 
for a diode, and the depletion layer is labeled. The vertical distance qVo , 
also labeled in Fig. 6.11, represents the amount of energy required to move 
an electron across the junction [9, p. 141]. 

Figure 6.12 shows the energy level diagram for a forward biased pn 
junction. In a forward biased pn junction, current flows from the p-type to 
n-type side of the junction. More specifically, holes flow from the p-type 
to n-type region, and some of these holes neutralize excess charges in the 
depletion layer. The depletion layer becomes narrower. The electric field 
preventing the flow of charges gets smaller, and the voltage drop across the 
junction gets smaller. The energy q (W — 14) is labeled in Fig. 6.12 for a 
forward biased pn junction where the voltage 14 is the voltage supplied. 
This energy represents the energy needed to get charges to flow across the 
junction, and it is smaller than the corresponding energy in the case of the 
unbiased junction. Charges flow more easily in the case of a forward biased 
pn junction, and the diode acts as a wire. 

Figure 6.13 shows the energy level diagram for a reversed biased pn 
junction. For a reverse biased pn junction, the voltage across the junction 
Vo + 14 is larger than for an unbiased junction, and the energy needed for 
charges to flow q (14 -|- 14) is larger than for an unbiased junction. Reversed 
biased pn junctions act as open circuits, and charges do not flow due to 
this amount of energy required. 

A light emitting diode (LED) is a device that converts electricity to 
optical electromagnetic energy, and it is made from a semiconductor pn 
junction. In use, a forward bias is put across the LED as shown in Fig. 
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position 


Figure 6.11: Energy level diagram of an unbiased pn junction. 



position 


Figure 6.12: Energy level diagram of a forward biased pn junction. 
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Figure 6.13: Energy level diagram of a reversed biased pn junction. 


6 .12. Holes flow from the p-type to n-type region. Some of these holes 
combine with electrons in the depletion layer. In an LED, photons are 
emitted in this process. The energy of the emitted photon corresponds to 
the energy of the energy gap. Some LEDs have an additional intrinsic, 
undoped, layer at the junction, between the p-type and n-type layers to 
improve the efficiency of the device. 

A solar cell and an optical photodetector are also essentially pn junc¬ 
tions. Both of these devices convert optical electromagnetic energy to elec¬ 
tricity. When light shines on these devices, electron-hole pairs are created 
at the junction. Due to the charge distribution across the junction, many 
of the electrons and holes created are swept away from the junction before 
they can recombine [9]. This flow of charges is a current, so the optical 
electromagnetic energy is converted into electricity. When light shines on 
a photovoltaic device, a voltage can be measured across the junction, and 
this effect is called the photovoltaic effect [9, p. 212]. 

The vertical distance between the conduction band and the valence 
band on an energy level diagram is the energy gap Eg. The energy gap 
of the material used to make a solar cell or photodetector determines the 
properties of the device. Photons with energy greater than the energy gap 
have enough energy to form electron-hole pairs while photons with less 





6 PHOTOVOLTAICS 


127 


UV \Vi*W*A^MIccMd IR / Thermal (amilMd) IR i Mcrowavs 



0.2|im OS 1.0 5 10 20 lOoVni 0.1cm 1.0cm 1.0m 


Wavelanoih (not lo tcalt) 

Figure 6.14: Diagram of atmospheric windows- wavelengths at which elec¬ 
tromagnetic radiation will penetrate the Earth’s atmosphere. Chemical 
Notation (CO 2 , O3) indicates the gas responsible for blocking sunlight at 
a particular wavelength. This figure is used with permission [72]. 


energy cannot. 

If a temperature gradient is applied across a pn junction, charges fiow. 
When one side of the device is heated, charges move more rapidly and these 
energetic charges diffuse to the cooler side. This effect, called the Seebeck 
thermoelectric effect, is discussed in Chapter 8 . 

6.6 Solar Cells 

6.6.1 Solar Cell Efficiency 

Energy conversion devices are never 100% efficient. Efficiency is defined as 
the output power over the input power. Efficiency of a solar cell is often 
defined as the ratio of electrical power out to optical power in to the device. 

P 

^ electrical out //-. oo\ 

Veff = - (6.23) 

^optical in 

Not all sunlight reaches a solar cell because some of it is absorbed by 
the earth’s atmosphere. This atmospheric absorption is strongly depen¬ 
dent on wavelength. Eigure 6.14 is a plot of the transmissivity of the 
atmosphere as a function of wavelength. It plots the percent of fight which 
passes through the atmosphere without getting absorbed. Some gases in 
the atmosphere, such as water vapor and CO 2 , absorb a significant amount 
of energy at particular wavelengths. The figure indicates which gas is re¬ 
sponsible for atmospheric absorption at some particular wavelengths. Eor 
example, ozone O 3 absorbs ultraviolet light. Ozone in the atmosphere offers 
benefits because ultraviolet light can damage eyes and skin. The intensity 
of the optical power from the sun that is hits a solar cell varies from day 




128 


6.6 Solar Cells 


to day and location to location. In a bright sunny area, a solar cell may 
receive around 0.1 [73, p. 7]. 

Even if energy from the sunlight reaches a solar cell, the energy is not 
converted to electricity with perfect efficiency. There are multiple reasons 
for this inefficiency, and some of these reasons relate to the fact that not 
all light that hits a solar cell is absorbed. Light may heat up the solar 
cell instead of exciting electrons to create electron-hole pairs [74]. Alter¬ 
natively, light may be reflected off the solar cell surface [74]. Many solar 
cells have an antireflection coating to reduce reflections, but they are not 
eliminated. The surface of other solar cells are manufactured to be rough 
instead of smooth to reduce reflections. Furthermore, if a photon hits an 
electron that is already excited, the photon will not be absorbed. Addition¬ 
ally, solar cells have wires throughout the surface to capture the produced 
electricity. These wires are often thin and in a flnger-like configuration. 
Light that hits these wires does not reach the semiconductor portion of 
the solar cell and is not efficiently converted to electricity. To reduce this 
issue, wires of some solar cells are made from materials that are partially 
transparent conductors, such as indium tin oxide or tin oxide Sn 02 [74]. In¬ 
dium tin oxide is a transparent conductor with a moderately high electrical 
conductivity of a = 10 ® [ 75 ] 

Other reasons that solar cells are not perfectly efficient have to do with 
what happens after a photon excites an electron. An electron may be 
excited, but it may decay before it gets swept from the junction [74]. A 
photon may excite an electron to a level above the conduction band, but the 
electron may quickly decay to the top of the conduction band losing some 
energy to heat. Internal resistance in the bulk n-type or p-type regions 
may convert electricity to heat. There may also be internal resistance of 
wiring in the system. Also unmatched loads make solar cells less efficient 
than matched loads [74]. 

The voltage across and the current produced by an illuminated solar 
cell are both functions of temperature. Reference [76] demonstrates, both 
theoretically and experimentally, that efficiency of a solar cell decreases 
as temperature increases. A number of mechanisms occurring in a solar 
cell are dependent on temperature. First, as the temperature increases, 
the allowed energy levels broaden. For this reason, the energy gap Eg, 
which is proportional to the voltage produced by the solar cell, is smaller 
at higher temperatures. As temperature increases, this voltage produced 
by the solar cell decreases roughly linearly [76]. Second, the current due 
to recombination of electron-hole pairs at the junction is a function of 
temperature. At higher temperatures, more electron-hole pairs recombine 
at the junction, so the overall current produced by the solar cell is less. For 
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this reason, as temperature increases, the overall current produced by the 
solar cell decreases roughly exponentially [76]. This effect on the current is 
the main reason that solar cell efficiency depends on temperature. Other 
mechanisms are temperature dependent, but are less significant [76]. 

6.6.2 Solar Cell Technologies 

There are four major solar cell technologies being developed: crystalline, 
thin film, multijunction cells, and emerging photovoltaic technologies [77]. 
However, these categories are not distinct because some solar cells fit into 
multiple categories simultaneously. Figure 6.15, from [77], compares solar 
cells of these technologies. More specifically, it shows record efficiencies 
for each of these types of solar cells as well as the year the records were 
achieved. 

The first category is crystalline, and these cells may be made from sin¬ 
gle crystals or from polycrystalline material [78]. The first generation of 
solar cells was made with this technology. For a simple recipe for how to 
produce a crystalline solar cell, see [69]. Most solar cells produced today, 
around 80% of the market, are silicon cells in this category. Typical effi¬ 
ciency of a crystalline solar cell available today may be around 20% [78]. 
Polycrystalline solar cells are often cheaper and a bit less efficient than 
single crystalline cells. 

The second category is thin film. To make these solar cells, thin films 
of semiconductors are deposited on a substrate such as glass or steel. The 
substrate may be rigid or flexible. The solar cell itself may be made of 
layers of material only a few microns thick. Thin film solar cells may be 
cheaper than other types of solar cells [78]. Often they are less efficient 
than crystalline cells, but they have other advantages [78]. One material 
used to make thin film solar cells is amorphous silicon. Another material 
in use is CdTe, which has a energy gap 1.45 eV. Cadmium and tellurium 
are both toxic, but they may be easier to deposit in thin films than silicon. 

The third category is multijunction, also called compound, solar cells. 
These solar cells are made of a dozen or more layers of semiconductor 
stacked on top of each other [78]. These layers form multiple pn junc¬ 
tions. Larger gap semiconductors are on the upper layers, and smaller gap 
semiconductors are closer to the substrate. These solar cells can be quite 
efficient. Cells with efficiency up to 46% have been demonstrated in labs 
|77|. 

The last category is emerging technology solar cells. Multiple creative 
strategies are being used to develop solar cells. Nanotechnology strategies 
include using solar cells made from carbon nanotubes and from quantum 
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dot based materials [78]. Organic solar cells also fall into this category. 
The active part of these solar cells is a thin, often 100-200 nm, layer of 
an organic material [79]. One advantage of organic solar cells is that their 
processing may not require as high of temperatures as the processing of 
solar cells made from pn junctions of inorganic semiconductors [79]. 

6.6.3 Solar Cell Systems 

Solar cells are used in a wide range of devices. Inexpensive lawn ornaments 
with solar cells are available at hardware stores for less than a dollar. Small 
photovoltaic devices used as optical sensors are equally inexpensive. On 
the other extreme, solar cells power the NASA Mars rovers Spirit and 
Opportunity as well as satellites orbiting the earth. Also, large arrays of 
solar cells are used to generate electricity. 

A typical solar cell produces around a watt of electrical power while 
a typical house may require around 4 kW of power [73]. To produce the 
necessary power, individual solar cells are connected together into modules, 
and the modules are connected together into solar panels. In a typical in¬ 
stallation on the roof of a house, a panel may be composed of around 40 
solar cells, and 10 or 20 panels may be mounted roof [73]. A typical solar 
panel installation on the roof of a building has a number of components 
in addition to the solar panel arrays. The additional components are of¬ 
ten referred to as the balance of the system, and they consist of batteries, 
mounting or tracking hardware, solar concentrators, and power condition¬ 
ers. These components are illustrated in Fig. 6.16. 

The mounting system is composed of the foundation, mechanical sup¬ 
ports, brackets, and wiring needed to physically mount and connect the 
solar panel. Some solar panels are mounted in a fixed position. Other 
solar panels are mounted on systems that angle the panels towards the 
sun. Some tracking systems rotate the panel around a single east-west 
axis. Others have two axes. Two axis tracking systems are often used with 
solar concentrators. A concentrator is a mirror or lens system designed to 
capture more of the sun’s light onto the panels. 

Solar panel systems require batteries or some other energy storage mech¬ 
anism to provide electrical power at night, on cloudy days, and other times 
when inadequate sunlight falls on the solar panels. Solar panels can last 
30 years or more with only about 1% or 2% degradation per year. Also, 
solar panels rarely need maintenance, and they cannot easily be repaired. 
If a solar panel fails, the entire panel is replaced. However, batteries have 
a typical lifetime of three to nine years, and they are often the first part of 
a solar panel system that needs replacement [73]. 
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Figure 6.15: Best efficiency of various types of solar cells. This plot is 
courtesy of the National Renewable Energy Laboratory, Golden, CO [77]. 
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Block Diagram of solar panel system 


Solar Panel Array 



Figure 6.16: Components of a solar panel system. 


The power conditioning system consists of an inverter which converts 
DC electricity to AC and, for grid tied systems, a system to match the phase 
of the produced AC power to the phase of the grid. Power conditioning 
systems also contain a system to limit the current or voltage to maximize 
the power delivered. Also, they include safeguards such as fuses to prevent 
injury or damage to equipment. The typical lifetime for the electronics 
may be around 10-15 years [73]. 

6.7 Photodetectors 

6.7.1 Types of Photodetectors 

Photodetectors are sensors used to convert light, at optical or other nearby 
frequencies, to electricity. One way to classify photodetectors is by their 
type of active material, which may be a solid or a gas. The first type of 
detectors are semiconductor photodetectors made from solid semiconduc¬ 
tor pn junctions. The choice of semiconductor infiuences the wavelengths 
of fight which can be absorbed because only photons with energy greater 
than or equal to the energy gap of the semiconductor can be absorbed. For 
example, silicon has an energy gap of 1.11 eV, so it is able to absorb the 
photons in both the visible range 1.9 eV < i? < 3.1 eV as well as photons in 
the near infrared range 1.1 eV < E < 1.9 eV. In some semiconductor pho¬ 
todetectors, a thin intrinsic (undoped) layer is added between the p-type 
material and the n-type material at the junction. In these semiconductor 
p-i-n junction photodetectors, the added layer widens the depletion layer. 
It also decreases the internal capacitance of the junction thereby increasing 
the detector response time [10, p. 660]. The second type of detectors are 
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made from gas filled vacuum tubes, and these detectors are called pho¬ 
totubes [10, p. 646]. A voltage is placed across electrodes in the tubes. 
When light shines on the phototube, energy from a photon of light can rip 
off an electron from a gas atom. The electron and ion how towards the 
electrodes, thereby producing electricity. The most common type of pho¬ 
totube is the photomultiplier tube. This device has multiple electrodes, and 
when an electron hits one of these electrodes, additional electrons are emit¬ 
ted. These electrons can hit additional electrodes to produce even more 
electrons. Because each incoming photon produces a cascade of electrons, 
photomultiplier tubes have high internal amplification. 

Another way to classify photodetectors depends on whether incoming 
photons have enough energy to rip off electrons or just excite them. The 
first type of detectors are called photoeleetrie deteetors, and they operate 
based on a process called photoeleetrie emission [10, p. 645] [27, p. 171]. 
In these detectors, incoming light has energy greater than or equal to the 
energy from the valence band to the ground level at the top of an energy 
level diagram. These detectors convert light to electricity because incoming 
photons of light rip electrons off their atoms, and the how of the resulting 
electrons is a current. The second type of detectors are called photoeondue- 
tive deteetors or sometimes photovoltaie deteetors, and they operate based 
on a process called photoeonduetivity [10, p. 647]. In these detectors, in¬ 
coming light has energy equal to the difference between the valence and 
conduction bands, not enough to rip off electrons. These detectors con¬ 
vert light to electricity because incoming photons excite electrons, and the 
conductivity of the detector is higher when light shines on it. Solid semi¬ 
conductor photodetectors can operate based on either photoelectric emis¬ 
sion or photoconductivity, but most operate based on photoconductivity. 
Phototubes typically operate based on photoelectric emission. 

Some photodetectors have a single element while others are made from 
an array of elements. A digital camera may contain millions of individual 
photodetectors. These elements are integrated with a eharge-eoupled deviee 
(CCD), which is circuitry to sequentially transfer the electrical output of 
each photodetector of the array [9, p. 359]. The CCD was invented in 1969 
by Willard S. Boyle and George E. Smith. For this invention, they shared 
the 2009 Physics Nobel Prize with Charles K. Kao, who was awarded the 
prize for his work on optical fibers [80]. 

Eyes in animals are photodetectors. The retina of the human eye is an 
array composed of around 120 million rod cells and 6 to 7 million cone cells 
[81]. These cells convert light to electrical impulses which are sent to the 
brain. 
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6.7.2 Measures of Photodetectors 

The frequency response is one of the most important measures of a photode¬ 
tector. Often it is plotted versus wavelength or photon energy instead of 
frequency. A photodetector is only sensitive within a particular wavelength 
range, and the frequency response is often not flat. 

As with all types of sensors, signal to noise ratio is another important 
measure. While photodetectors have many sources of noise, one major 
source is thermal noise due to the random motion of charges as they flow 
through a solid [9, p. 220]. To mitigate thermal noise in photodetectors 
used to detect very weak signals, the detectors are cooled with thermoelec¬ 
tric devices or using liquid nitrogen. A measure related to signal to noise 
ratio is the noise equivalent power. It is deflned as the optical power in 
watts that produces a signal to noise ratio of one [82]. 

Another measure of a photodetector is the detectivity^ denoted D*, in 

cmfHzi/2') , P , 

units-- - . It IS a measure of the strength of the output assuming a 

one watt optical input. By definition, it is equal to the square root of the 
area of the sensor times the bandwidth under consideration divided by the 
noise equivalent power [82] [83, p. 654]. 

^ VArea ■ Bandwidth 

Noise Equivalent Power 

Figure 6.17 shows detectivity versus wavelength for optical detectors made 
of various semiconductors. 

Photodetectors are also characterized by their response times. Response 
time is deflned as the time needed for a photodetector to respond to a step¬ 
like optical input [82]. Typical response times can range from picoseconds 
to milliseconds [83, p. 656]. There may be a tradeoff between response 
time and sensitivity, so some detectors are designed for fast operation while 
others are design for higher sensitivity [9, p. 220]. 
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Figure 6.17: Spectral response of a variety of photodetectors. This figure 
is used with permission from Hamamatsu [82]. 
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6.8 Problems 

6.1. Rank the materials from smallest energy gap to largest energy gap: 

• Indium arsenide, InAs 

• Aluminum arsenide, AlAs 

• Gallium arsenide, GaAs 

6.2. The energy level diagram for a silicon pn junction is shown in the 
figure below. Part of the device is doped with Ga atoms, and part of 
the device is doped with As atoms. Label the following: 

• The valence band 

• The conduction band 

• The energy gap 

• The n-type region 

• The p-type region 

• The depletion layer 

• The part of the device doped with Ga 

• The part of the device doped with As 



Position 




6 PHOTOVOLTAICS 


137 


6.3. The figure in the previous problem shows the energy level diagram 
for a semiconductor pn junction. 

(a) If this pn junction is used in an LED, what will be the wavelength 
in nm of the light emitted by the LED? 

(b) If this pn junction is used as a solar cell, what range of wave¬ 
lengths of light will be absorbed by the solar cell? 

6.4. A semiconductor is used to make an LED that emits red fight at 
A = 630 nm. 

(a) Eind the energy gap in eV of the semiconductor. 

(b) Eind the energy in joules of a photon emitted. 

(c) Eind the energy in joules for Avogadro constant number of these 
photons. 

6.5. The figure below shows the energy level diagram for a gallium arsenide 
LED. 

(a) Eind the energy gap. 

(b) Eind the energy of a photon emitted by the LED. 

(c) Eind the frequency in Hz of a photon emitted by the LED. 



Position 
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6 .6. Use Fig. 6.6 to answer this question. 

(a) Suppose you would like to make an LED that emits red light with 
a wavelength of 650 nm. Suggest three possible semiconductor 
materials that could be used. 

(b) Suppose you would like to make a layered solar cell using layers 
of the following materials: InP, Ino.sGao.sAs , and AlAso.sSbo.s, 
Which layer would be on top, in the middle, and on the bottom 
of the device, and why? 

6.7. Use Fig. 6.6 to answer this question. 

(a) Find the energy gap of InPo.iAso.g in the units of joules. 

(b) If InPo.iAso.g is used to make an LED, find the expected fre¬ 
quency, in Hz, of the photons emitted. 

(c) Would it be better to make a solar cell out of gallium phosphide 
or indium phosphide? Why? 

6 .8. A solar panel produces an average power of 800 W. The panel is in a 
location which receives an average of 0.07 of optical energy from 
the sun. Assume the panel has an efficiency of 9%. 

(a) Calculate the surface area of the solar panel in units m^. 

(b) Calculate the average amount of energy (in eV) produced in one 
week. 

6.9. A solar panel has an area of 50 m^, and it produces an average of 4 
kW of power. The panel is in a location which receives an average of 
0.085 of optical energy from the sun. Calculate the efficiency of 
the panel. 
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7 Lamps, LEDs, and Lasers 

7.1 Introduction 

Chapter 6 discussed devices that convert light to electricity. In this chapter, 
we discuss devices that convert electricity to light. These devices vary 
widely in size and shape from tiny Light Emitting Diodes (LEDs) and 
semiconductor lasers to large high power gas lasers. In addition to LEDs 
and lasers, lamps and optical amplifiers are also discussed. 

We take lamps for granted now because they are present in practically 
all buildings. However, their invention dramatically improved human pro¬ 
ductivity because lamps allowed people to constructively use indoor spaces 
at night. Similarly, lasers have improved productivity in many activities. 

We encounter them almost daily in our use of communications networks, 
DVD players, medical devices, and in other applications. 

7.2 Absorption, Spontaneous Emission, Stimulated Emis¬ 
sion 

Absorption, spontaneous emission, and stimulated emission are three re¬ 
lated energy conversion processes. Chapter 6 discussed devices based on 
absorption including solar cells and photodetectors. Devices which operate 
based on spontaneous emission include LEDs and lamps. Optical amplifiers 
and lasers operate based on stimulated emission. 

7.2.1 Absorption 

Absorption is the process in which optical energy is converted to internal 
energy of electrons, atoms, or molecules. When a photon is absorbed, the 
energy may cause an electron in an atom to go from a lower to a higher 
energy level, thereby changing the internal momentum of the electron and 
the electron’s internal quantum numbers. This process was illustrated in 
Chapter 6 by energy level diagrams. Energy in a solar cell or photodetector 
is then converted to electricity because the excited charge carriers can travel 
more freely through the material. The electrons absorbing the energy may 
be part of atoms which make up solids, liquids, gases, or plasmas. They 
may be around isolated neutral atoms, ionic compounds, or complicated 
organic molecules. Eurthermore the electrons absorbing the energy may be 
part of conductive, insulating, or semiconducting materials. The photons 
absorbed may be optical photons, with individual energies in the range 1.9 
to 3.1 eV that can be detected by human eyes. Alternatively, they may have 
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energies that are multiple orders of magnitude larger or smaller than the 
energy of a visible photon. Eor example, in isolated neutral neon atoms in 
the ground state, electrons occupy the 2p energy level but not the 3s energy 
level. These energy levels are separated by an energy gap of Eg = 1.96 eV 
which corresponds with energy of red photons of wavelength 632.8 nm [31]. 
If a photon of this energy impinges upon neon gas, the photon may be 
absorbed, and an electron of a neon atom would be excited to the higher 
energy level. Photons of smaller energy would not be absorbed. Photons 
of larger energy may be absorbed depending on allowed energy levels. As 
another example, the energy gap of the semiconductor gallium phosphide, 
GaP, is 2.2 eV which corresponds with the energy of a green photon of 
wavelength 549 nm. If a photon of this energy impinges on a piece of 
gallium phosphide, it may be absorbed. 

7.2.2 Spontaneous Emission 

Spontaneous emission is an energy conversion process in which an excited 
electron or molecule decays to an available lower energy level and in the pro¬ 
cess gives off a photon. This process occurs naturally and does not involve 
interaction of other photons. The average time for decay by spontaneous 
emission is called the spontaneous emission lifetime. Eor some excited en¬ 
ergy levels this spontaneous decay occurs on average within nanoseconds 
while in other materials it occurs within a few seconds [10, p. 480]. As 
with absorption, this process can occur in isolated atoms, ionic compounds, 
molecules, and other types of materials, and it can occur in solids, liquids, 
and gases. Energy is conserved when the electron decays to the lower level, 
and that energy must go somewhere. The energy may be converted to 
heat, mechanical vibrations, or electromagnetic photons. If it is converted 
to photons, the process is called spontaneous emission, and the energy of 
the photon produced is equal to the energy difference between the electron 
energy levels involved. The emitted photon may have any direction, phase, 
and electromagnetic polarization. 

There are many ways in which an electron can be excited to a higher 
energy level [10, p. 455]. Spontaneous emission processes may be classified 
based on the source of energy which excites the electrons, and these classes 
are listed in Table 7.1. If the initial source of energy for spontaneous emis¬ 
sion is supplied optically, the process is called photolumineseenee. Glow in 
the dark materials emit light by this process. If the initial form of energy 
is supplied by a chemical reaction, the process is called ehemilumineseenee. 
Glow sticks produce spontaneous emission by chemiluminescence. If the 
initial form of energy is supplied by a voltage, the process is called eleetro- 
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Spontaneous emission energy source 

Photoluminescence 

Optical electromagnetic waves 

Chemiluminescence 

Chemical reactions 

Electroluminescence 

Applied voltages 

Sonoluminescence 

Sound waves 

Bioluminescence 

Biological processes 


Table 7.1: Spontaneous emission is classified based on the source of energy 
[10, p. 455], 


luminescence. LEDs emit fight by electroluminescence. If the initial form 
of energy is caused by sound waves, the process is called sonoluminescence. 
If the initial form of energy is due to accelerated electrons hitting a target, 
this process is called cathodoluminescence. If spontaneous emission occurs 
in a living organism, such a firefiy, the process is called bioluminescence. 

At temperatures above absolute zero, some electrons in atoms are ther¬ 
mally excited to energy levels above the ground state. These electrons decay 
and emit a photon by spontaneous emission. Any object at a temperature 
above absolute zero naturally emits photons by spontaneous emission, and 
this process is called blackbody radiation. In 1900, Max Planck derived a 
formula for the energy density per unit bandwidth of a blackbody radiator 
by making the assumption that only discrete energies are allowed [10, p. 
453]. His work agreed with known experimental data, and it is one of the 
fundamental ideas of quantum mechanics. More specifically, the spectral 
energy density per unit bandwidth, u in units is given by 


SttP hf 

q3 \ 


(7.1) 


Equation 7.1 includes a number of constants including c the speed of light 
in free space, h the Planck constant, and ks the Boltzmann constant. Ad¬ 
ditionally, / is frequency in Hz, and T is temperature in kelvins. Eor a nice 
derivation, see [84, p. 186]. The first term represents the number of modes 
per unit frequency per unit volume while the second term represents the 
average energy per mode. The expression can be written as a function of 
wavelength instead of frequency with the substitution / = f. 

Photons emitted by a blackbody radiator have a relatively wide range of 
wavelengths, and this bandwidth depends on temperature. Eigure 7.1 plots 
the energy density per unit bandwidth for blackbody radiators as a function 
of wavelength at temperatures 3000, 4000, and 5000 K. Room temperature 
corresponds to around 300 K. Visible photons have wavelengths between 
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Wavelength ((jm) 

Eigure 7.1: Spectral energy density of a blackbody radiator. This figure is 
in the public domain [85]. 


400 nm < A < 650 nm. Erom the figure, we can see that black body 
radiators at higher temperatures emit both more photons and have a larger 
fraction of photons emitted fall in the visible range. 


7.2,3 Stimulated Emission 

Stimulated emission is the process in which an excited electron or molecule 
interacts with a photon, decays to an available lower energy level, and in 
the process gives off a photon. As with the other processes, this process 
can occur in isolated atoms, ionic compounds, organic molecules, and other 
types of materials, and it can occur in solids, liquids, and gases. If an in¬ 
coming photon, with energy equal to the difference between allowed energy 
levels, interacts with an electron in an excited state, stimulated emission 
can occur. The energy of the excited electron will be converted to the 
energy of a photon. The stimulated photon will have the same frequency, 
direction, phase, and electromagnetic polarization as the incoming photon 
which initiated the process [10, p. 436]. 
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Absorption Spontaneous Emission Stimulated Emission 



Eigure 7.2: Energy level diagrams illustrating absorption, spontaneous 
emission, and stimulated emission. 

7.2.4 Rate Equations and Einstein Coefficients 

The processes of absorption, spontaneous emission, and stimulated emis¬ 
sion are illustrated by energy level diagrams in Eig. 7.2. Energy is on the 
vertical axis, and nothing is plotted on the horizontal axis. Only two energy 
levels are shown, so this diagram illustrates only a small fraction of possi¬ 
ble energy levels of a material. The lower energy level is labeled 1. It may 
represent, for example, the highest occupied energy level of an electron in 
an isolated atom, or it may represent the valence band of a semiconductor. 
The higher energy level is labeled 2, and it may represent the lowest un¬ 
occupied energy level of an electron in an isolated atom or the conduction 
band of a semiconductor. The dot represents an electron occupying the 
energy level at the start of the process. The squiggly arrows represent a 
photon absorbed or emitted by the process. The vertical arrow shows how 
the internal energy of the electron changes in the process. During absorp¬ 
tion, an electron takes energy from an incoming photon, and the internal 
energy of the electron increases. During spontaneous emission, the inter¬ 
nal energy of an electron decreases, and a photon is emitted. Stimulated 
emission occurs when a photon, with energy equal to the energy gap of the 
levels, interacts with the electron. In the process, the electron decays to 
the lower energy level, and a photon is produced with the same frequency, 
direction, phase, and electromagnetic polarization as the original photon. 
The figures do not illustrate a change in position of the electrons. Instead, 
they illustrate a change in energy and internal momentum. 

The descriptions of the processes above involve changes in energy levels 
of an electron. However, absorption, spontaneous emission, and stimulated 
emission can instead involve vibrational energy states of molecules. Eor ex¬ 
ample, a photon may be absorbed by a molecule, and the energy may cause 
the molecule to go from one allowed vibrational state to another with higher 
internal energy. Similarly, this molecule may spontaneously decay from the 
higher energy state to a lower energy state emitting a photon by sponta- 
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neons emission or by stimulated emission. An example involving molecular 
vibration states is a carbon dioxide laser. This laser produces infrared light 
by stimulated emission at A = 10.6 /rm, and the stimulated emission oc¬ 
curs between allowed vibrational energy levels of the CO 2 molecule [31, p. 
217]. However, to simplify the discussion in this text, we will assume that 
electron energy levels are involved. This assumption is true in most, but 
not all, energy conversion devices. 

What factors determine the rate of these processes? Assume only two 
energy levels are involved. The number of electrons per unit volume in the 
lower state will be denoted ni, and the number of electrons per unit volume 
in the upper state will be denoted n 2 . The rate of absorption will be denoted 
, the rate of spontaneous emission will be denoted ,, and the 

rate of stimulated emission will be denoted Since only two energy 

levels are involved in this system, we can describe the rates of the processes 
either in terms of the upper or lower energy levels. Eor example, we can 
write the rate of absorption either as the change in population density with 
respect to time of the upper state or the change in population density with 
respect to time of the lower state. 


dn2 

dt 


abs 


drii 


dt 


abs 


(7.2) 


Absorption can only occur if there is an electron present in the lower 
energy level. Eurthermore, the rate of absorption is proportional to the 
number of electrons in the lower state. Additionally, the rate of absorption 
depends on the number of incoming photons. As in Eq. 7.1, u represent 
the spectral energy density per unit bandwidth in units We can model 
the rate of absorption in terms of these factors [84, ch. 6] [86, ch. 7]. 


dn2 

dt 


abs 


drii 



BuUiU 


(7.3) 


The constant of proportionality B 12 is called an Einstein B coefficient, and 

TTl 3 

it has units 

Spontaneous emission depends on the number of electrons in the upper 
energy level. We can model the rate of spontaneous emission as 


dn2 


dt 


spont 


dni 


dt 


spont 


— ^ 21^2 


(7.4) 


The constant of proportionality A 21 is called the Einstein A coefficient, and 
it has units ^ [84, ch. 6] [86, ch. 7]. No photons are needed to initiate 
spontaneous emission. 
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We can model the rate of stimulated emission as 


dn2 


dt 


stim 


drii 


dt 


stim 


-B2in2U. 


(7.6) 


The constant of proportionality B 21 is known as another Einstein B co¬ 
efficient, and it also has units [84, ch. 6] [86, ch. 7]. The rate of 
stimulated emission is dependent on the number of electrons in the upper 
energy level. Stimulated emission requires an incoming photon, so the rate 
also depends on the spectral energy density per unit bandwidth u. 

By considering the factors that affect the rate of absorption, sponta¬ 
neous emission, and stimulated emission, we can see some similarities and 
differences in the processes. As absorption occurs, the population of elec¬ 
trons in the upper energy level increases, and the population of the lower 
energy level decreases. As both spontaneous and stimulated emission oc¬ 
curs, the population of the upper energy level decreases, and the population 
of the lower energy level increases. Both the rate of absorption and the rate 
of stimulated emission depend on both the population of electrons in an 
energy level and the energy of incoming photons while the rate of sponta¬ 
neous emission does not depend on the energy of incoming photons. This 
similarity between absorption and stimulated emission is reflected in the 
rate equations, Eqs. 7.3 and 7.5. 

Einstein showed that if one of the coefficients describing the absorp¬ 
tion, spontaneous emission, or stimulated emission is known, the other 
coefficients can be calculated from it. We can combine the terms above to 
find the overall upper state population rate. 


dn2 

dt 


-A2in2 -k Bi2niu - B2in2U 


(7.6) 


At equilibrium, where photons are absorbed and emitted at the same rate, 
this population rate is zero. 


dn2 


dt 


equilibrium 


0 


-A2in2 -k Bi2niu - B2in2U 


We can solve for the energy density per unit bandwidth, u. 


(7.7) 


Buniu - - A2in2 


(7.8) 
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In the expression above, ^ represents the electron density in the lower 
energy state divided by the electron density in the upper state in equilib¬ 
rium. This quantity is a function of temperature. Assuming many allowed 
energy states, the number of occupied states decreases exponentially with 
temperature, an idea known as Boltzmann statistics. 


ni 



-hf 

kgT 


9i 


(7,10) 


The quantity ^ represents the degeneracy level which is the number of 
allowed electrons in the upper state over the number of allowed electrons 
in the lower state [84, p. 186]. In this expression, gi and g 2 are unitless 
measures of the number of ways electrons can occupy an energy states. 
Equations 7.10 and 7.11 can be combined. 


u = 


A 21 

- B21 


(7,11) 


u = 


giBi2 

92B21 


A 21 

B 21 

hf 

g 


1 


(7,12) 


Consider a blackbody radiator, a conducting wire which is continually 
supplied with heat so that it remains at temperature T in equilibrium. 


dn2 


dt 


equilibrium 


0 


(7.13) 


One expression for the energy density per unit bandwidth of this system is 
given by Eq. 7.1. Equation 7.12 gives a second expression for the energy 
density per unit bandwidth, and it was found by considering the relative 
rates of absorption, spontaneous emission, and stimulated emission. These 
equations can be combined to relate the rates of the different processes. 


STThf 1 

gS _ 1 ^ _ 

g2B2i 


(7.14) 


The above equation is true for the conditions 


A21 Mhp 

B2I ~ C 3 

(7.16) 

giBi 2 _ ^ 

(72 7 ^ 2 1 

(7.16) 


and 
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If we know one of the Einstein coefficients, we can quickly calculate the 
other two Einstein coefficients from Eqs. 7.15 and 7.16. 

These equations provide further insight into the operation of lasers and 
other devices based on stimulated emission. The overall nonequilibrium 
upper state population rate is given by 

= -A2in2 + B2i—niu - B2in2U (7-17) 

dt gi 

which can be simplified with some algebra. 

^ = -A2in2 - UB 21 (^2 - ( 7 - 18 ) 

The term in parenthesis is the net upper state population. Optical ampli¬ 
fication and lasing can only occur when the term in parenthesis is positive. 
The condition 

n2 - —ni > 0 (7.19) 

9i 

is called a population inversion [86, p. 189]. It only occurs when enough 
energy is being supplied to the system, by optical, electrical, or thermal 
means, so that there are more electrons in the upper energy level than the 
lower energy level. Population inversion has nothing to do with inversion 
symmetry discussed in Sec. 2.3.2. See Appendix C for a discussion of 
inversion and other overloaded terms. 

7.3 Devices Involving Spontaneous Emission 

Spontaneous emission occurs in many commercially available consumer 
products. This section discusses three categories of devices that convert 
electricity to light by spontaneous emission: incandescent lamps, gas dis¬ 
charge lamps, and LEDs. 

7.3.1 Incandescent Lamps 

An incandescent lamp is a device that converts electricity to fight by black- 
body radiation. These devices are typically constructed from a solid metal 
filament inside a glass walled vacuum tube. A current passes through the 
filament which heats it to a temperature of thousands of degrees. High 
temperatures are used because the visible spectral response of daylight is 
close to the visible spectral response of a blackbody radiator at a temper¬ 
ature of 6500 K [87]. The main limitation of incandescent lamps is their 
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efficiency. Much of the electromagnetic radiation emitted by a blackbody 
radiator falls outside the visible range. 

The main advantage of incandescent lamps over other technologies is 
their simplicity. For this reason, incandescent lamps were some of the 
earliest lamps developed. Humphry Davy demonstrated that blackbody 
radiation could be used to produce visible light in 1802, and practical in¬ 
candescent lamps date to the 1850s [88]. In order to develop these practical 
incandescent lamps, vacuum pumping technology had to be developed, and 
technology to purify the metal used to make lamp filaments was required 
| 88 |. 

In some ways, an incandescent lamp is similar to an antenna. In both 
cases, the input takes the form of electricity, and this electrical energy is 
converted to electromagnetic energy by passing through a conducting wire. 
In an antenna, the input is time varying to encode information, and the 
output is at radio or microwave frequencies. However, in an incandescent 
lamp, the input is typically AC and does not contain information. The 
desired output of an incandescent lamp is visible fight, but it also produces 
heat and electromagnetic radiation at infrared frequencies and at other 
non-visible frequencies. Additionally, antennas are typically designed to 
operate at a wavelength close to the length of the antenna, and such an¬ 
tennas can produce waves with specific electromagnetic polarization and 
radiation patterns. Spontaneous emission in incandescent lamps, however, 
is necessarily unpolarized and incoherent. 

7.3.2 Gas Discharge Lamps 

A gas discharge occurs when a conducting path forms through a plasma, 
an ionized gas [89]. Gas discharge devices convert electricity to fight by 
spontaneous emission when this type of conducting path forms. In 1802 
in addition to demonstrating blackbody radiation and proposing the idea 
of a fuel cell, Humphry Davy demonstrated a gas discharge device [3, p. 
222] [88]. W. Petrov demonstrated a gas discharge around the same time 
[88]. One of the first practical gas discharge lamps, a carbon arc lamp, was 
built by Leon Foucoult in 1850, and it was used for theater lighting [88]. 
Development of gas discharge lamps required the ability to purify gases in 
addition to the development of vacuum pumping technology [88]. Examples 
of gas discharge devices in use today include include sodium vapor lamps, 
mercury arc lamps, fluorescent lamps, and neon advertising signs [89]. 

A gas discharge lamp is made from a sealed tube containing two elec¬ 
trodes and filled by a gas. The glass tube contains the gas, maintains the 
gas pressure, and keeps away impurities. The pressure of the gas inside the 
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tube can range from 10“"^ Pa to 10^ Pa for different lamps [87, p. 206]. 
Typical electrode spacing is on the order of centimeters [87]. Some neon 
bulbs have an electrode spacing of 1 mm while many fluorescent tubes have 
an electrode spacing over 1 m. Hundreds to millions of volts are applied 
across the electrodes [89]. Transformers are used to achieve these high 
voltage levels. The voltage between the electrodes ionizes the gas inside 
the tube and provides a supply of free electrons which travel along the 
conducting path between the electrodes [89]. The gas may be ionized, and 
electrons supplied, by other methods such as chemical reactions, a static 
electric held, or an optical held instead [87, Ch. 5]. Electrons may also 
be supplied to the gas by thermionic emission, boiling electrons off the 
cathode. 

The optical properties of the lamp are determined by the gas inside 
the tube. Energy supplied by the electric held across the electrodes, or 
other means, excites electrons of the gas atoms to higher energy levels. 
Spontaneous emission occurs between distinct allowed energy levels only, 
so the emission occurs over relatively narrow wavelength ranges. Gases are 
chosen to have allowed energy level transitions in the desired wavelength 
range. Typical gases used include helium, neon, sodium, and mercury [87, 
p. 514]. 

Gas discharge lamps are classified as either glow discharge devices or arc 
discharge devices. Eigure 7.3 shows an example plot of the current between 
electrodes as a function of voltage. As shown in the figure, the current- 
voltage characteristic of a gas discharge tube is quite nonlinear. However, 
it can be broken up into three general regions, denoted the dark region, the 
glow region, and the arc region. The regions are distinguished by a change 
in slope of the current-voltage plot. This figure is used with permission 
from [89] which provides more details on the physics of gas discharges. 

The dark region of operation corresponds to low currents and voltages, 
and devices operating in this region are said to have a dark or Townsend 
discharge. Optical emission from devices operating in this region are not 
self sustaining. While atoms of the gas may ionize and collide with other 
atoms, no chain reaction of ionization occurs. The transition between the 
dark and glow discharges is called the spark [87, p. 160]. In Eig. 7.3, 
Vs is the sparking voltage. The second region, corresponding to higher 
currents, is called the glow region, and this region is called self sustaining 
because ions collide and ionize additional gas atoms producing more free 
electrons in an avalanche process. Significant spontaneous emission occurs 
in the glow discharge region [87] [89]. The third region, corresponding to 
even higher current, is called the arc region. Arc discharges are also self 
sustaining [87, p. 290], and spontaneous emission is produced. Once the 
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I 



Discharge Static V-I Characteristic 


Figure 7.3: Example current-voltage characteristics of a gas discharge 
lamp. Figure used with permission from [89]. 


arc discharge is established, relatively low voltages are required to maintain 
it compared to the voltages needed to maintain the glow discharge. 

Fluorescent lamps are a type of gas discharge device that involves the 
use of chemicals with desired optical properties, called phosphors [87, p. 
542]. The gas and electrode voltage used in fluorescent lamps is chosen 
to so that the spontaneous emission produced is at ultraviolet frequencies. 
These UV photons may be produced by either an arc or glow discharge. 
The UV photons produced are absorbed by the phosphor molecules, and the 
phosphor molecules emit light at lower frequencies. Examples of phosphors 
used include zinc silicate, calcium tungstate, and zinc sulflde [87, p. 542]. 

7.3.3 LEDs 

LEDs are devices that convert electricity to light by spontaneous emis¬ 
sion. They are made from pn junctions in semiconductors. Pn junctions 
were discussed in Section 6.5. When a forward bias is applied across a pn 
junction, electrons and holes are injected into the junction. The energy 
from the power supply excites electrons from the valence to the conduction 
bands. These excited electrons can flow through the material much more 
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easily than unexcited electrons. Some of the electrons and holes near the 
junction combine and spontaneously emit photons in the process. Some 
LEDs have a thin intrinsic, undoped, layer between the p-type and n-type 
layers at the junction to improve efficiency. 

LEDs emit light over a relatively narrow range of frequencies. The 
frequency of light emitted is determined by the energy gap of the semi¬ 
conductor. Semiconductors are used because the energy gap of semicon¬ 
ductors corresponds to the energy of near ultraviolet, visible, or infrared 
photons. While light emitted by an LED has a narrow range of frequencies, 
lasers emit light with a much narrower range of frequencies. LEDs emit 
light within a narrow frequency range, but applications, such as residential 
lighting, require white light with a broader bandwidth. One strategy used 
to produce white light from an LED is to use phosphors. In such a device, 
an LED converts electricity to near UV or blue light. The phosphors ab¬ 
sorb the blue light and emit light at lower energies, at wavelengths in the 
visible range. Eor this reason, blue LEDs were particularly important for 
generating white light. It took decades from the invention of red LEDs in 
the 1960s until reliable blue LEDs were developed in the 1980s and 1990s. 
In 2014, Isamu Akasaki, Hiroshi Amano and Shuji Nakamura were awarded 
the Nobel Prize in physics for their work developing blue LEDs. This effort 
required the development of deposition technology for new materials like 
gallium nitride, and it required being able to deposit these materials in 
very pure layers without mechanical strain tearing the materials apart [90]. 

A related device which emits light by spontaneous emission is an or¬ 
ganic light emitting diode, OLED. In an OLED, a voltage excites electrons 
in a thin layer, 100-200 nm, of an organic material, and the type of organic 
material used determines the wavelength of light emitted [91]. Some flat 
panel displays are made from arrays of OLEDs. White light in these dis¬ 
plays is achieved from a combination of red, green, and blue OLEDs near 
to each other [91]. 

LEDs are small devices that can often fit into a cubic millimeter. Eor 
this reason, they can be integrated into electronics more easily than devices 
like incandescent lamps and gas discharge lamps which require vacuum 
tubes. LEDs require low voltages electricity to operate. Since they require 
a small amount of input electrical power, they produce a small amount of 
output optical power. Incandescent lamps and gas discharge lamps have 
advantages in high power applications, but arrays of LEDs can also be used 
in these applications. Another advantage of LEDs is that they have a longer 
useful lifetime. In gas discharge lamps, the electrodes sputter, depositing 
material onto the surface of the tube, limiting the lifetime of the device. 
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7.4 Devices Involving Stimnlated Emission 

7.4.1 Introduction 

Lasers are devices that produce optical energy through stimulated emis¬ 
sion and involve optical feedback. The word laser is an acronym for Light 
Amplification by Stimulated Emission of Radiation. Lasers come in a wide 
range of sizes and shapes. Some lasers produce continuous output power, 
denoted cw for continuous wave, and other lasers operate pulsed. One ad¬ 
vantage of pulsed operation is that the peak intensity of the light produced 
can be extremely high even with moderate average input power. Some 
lasers are designed to operate at room temperature while other lasers re¬ 
quire external cooling. 

The development of many energy conversion devices required techno¬ 
logical breakthroughs. The development of lasers, however, was preceded 
by breakthroughs in understanding of energy conversion processes in atoms 
and molecules. The idea of amplification by stimulated emission was first 
developed in the mid 1950s, [31, p. 183] [83, p. 687]. A maser, which oper¬ 
ated at microwave frequencies, was demonstrated only a few years later by 
Gordon, Zeiger, and Townes in around 1955 [83, p. 687]. In 1960, a ruby 
laser with visible output at A = 694 nm was demonstrated by Maiman, 
[83, p. 687]. Lasing in semiconductors was predicted in 1961 [92] and 
demonstrated within a year in gallium arsenide [93]. The development of 
semiconductor lasers required both the theoretical prediction as well as de¬ 
velopment in the ability to deposit pure thin semiconductor layers. Thin 
crystalline layers grown on top of a substrate are called epitaxial layers. 
Early semiconductor lasers were made by growing epitaxial layers from a 
liquid melt, through a process called liquid phase epitaxy [94]. In sub¬ 
sequent years, other methods which allowed more control and precision 
were developed including molecular beam epitaxy [95] and metal organic 
chemical vapor deposition [96]. 
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Eigure 7.4: Components of a laser. 


7.4,2 Laser Components 

Lasers have three main components: a power supply also called a pump, 
an active material, and a cavity. These components are illustrated in Eig. 
7.4 where mirrors form the cavity. Input energy from the power supply 
excites electrons or molecules in the active material. A photon interacts 
with the excited electrons or molecules of the active material stimulating 
the emission of a photon at the same frequency, phase, direction, and elec¬ 
tromagnetic polarization. The cavity reflects the photon back to the active 
material so that it can stimulate another photon, and this process continues 
to occur as these photons stimulate additional identical photons. 


Pumps 

Laser power supplies are called pumps. Energy may be supplied to lasers 
in different ways. Eor many lasers, energy is supplied electrically. Eor 
example, the pump of a semiconductor laser is typically a battery which 
supplies a DC current. These lasers are energy conversion devices which 
convert the input electricity to light. Eor other lasers, energy is supplied 
optically, so the pump is a lamp or another laser. These lasers are energy 
conversion devices which convert light with large energy per photon to light 
with smaller energy per photon. The power supply of early ruby lasers were 
flashlamps [86, p. 351]. As another example, argon ion lasers are used to 
pump titanium doped sapphire lasers. Argon ion lasers can be tuned to 
emit photons with energy 2.54 eV (A = 488 nm). These photons excite 
electrons in titanium doped sapphire. Titanium doped sapphire lasers are 
tunable solid state lasers which emit near infrared light [86, p. 392]. 
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Active Materials 

Active materials can be solids, liquids, or gases, and lasers can be classified 
based on the state of matter of the active material. The active material of 
a laser has multiple allowed energy levels, and energy conversion occurs as 
the active material transits between energy levels. When an electron tran¬ 
sits between energy levels, its internal momentum changes, not its spatial 
position. Typically, the pump excites an electron from a lower to higher 
allowed energy level, and a photon is emitted when the electron goes from 
a higher to lower energy level. In some lasers such as carbon dioxide lasers, 
however, molecular vibration states are involved instead of electron energy 
states. 

Optical amplification and lasing can only occur when there is a popula¬ 
tion inversion in the active material. The term population inversion means 
that more electrons are in the upper energy level than the lower energy 
level. The condition for a population inversion was defined by Eq. 7.19. A 
photon begins the process of stimulated emission, and another photon is 
produced in the process. Only in the case of a population inversion can the 
resulting photon be more likely to stimulate another photon than decay by 
spontaneous emission, by emitting phonons, or by other means. 

In some lasers, called two level lasers, the pump excites an electron 
from a lower energy level to a higher energy level, and lasing occurs as 
the electron transits back and forth between the same two levels. In other 
lasers, more energy levels must be considered. Eigure 7.5 illustrates possible 
electron transitions in two, three, and four level lasers, but other three and 
four level schemes are possible too. In the three level system illustrated in 
the figure, the pump excites electrons from level one to level three. The 
electrons quickly decay to level two, possibly emitting heat, and lasing 
occurs as electrons transit from level two to level one. In the four level 
scheme illustrated, the pump excites electrons from level one to four. The 
electrons quickly decay from level four to three, emitting heat in the process. 
Lasing occurs between energy levels three and two. The electrons then 
decay between levels two and one, again emitting heat, vibration, or some 
other form of energy. Some four level systems lase more easily than two 
level systems because a population inversion may be easier to achieve in 
four than two level systems. Lasing requires a population inversion, and 
level two may be less likely to be occupied than level one. 
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Eigure 7.5: Example energy level diagram for two, three, and four level 
lasers. 


Cavities 

Laser cavities have two main functions. They confine photons to the active 
material and they act as optical filters. The simplest optical cavity is made 
from two mirrors as shown in Eig. 7.4. This type of cavity is called a Fabry 
Perot cavity. More complicated cavities have multiple mirrors, lenses, and 
other optical components to focus the desired photons within the active 
material and reject photons at frequencies other than the desired frequency. 
Semiconductor lasers do not use separate mirrors to form the cavity. In 
some semiconductor lasers, the edges of the semiconductors act as mirrors 
because the index of refraction of the semiconductor is larger than that 
of the surrounding air thereby refiecting a portion of the fight back inside 
the semiconductor. The edges of these lasers are formed by cleaving along 
crystal planes to produce extremely fiat surfaces. In other semiconductor 
lasers, multiple thin layers of material act as mirrors. 

Even without an active material present, an optical cavity acts as an 
optical filter that selectively passes or rejects fight of different wavelengths. 
To understand this idea, consider the rectangular cavity shown in Eig. 7.6. 
Assume that the cavity has partial mirrors on the left and right side so that 
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Figure 7.6: The solid arrow shows the longitudinal direction while the 
dotted arrows show the transverse directions. The solid sinusoid shows an 
allowed longitudinal mode. The cavity length in the longitudinal direction 
is equal to The dotted sinusoids show allowed transverse modes. The 
cavity lengths in the transverse directions are equal to 

some light can enter the cavity on the left side and some light can exit the 
cavity on the right. The direction along the length of the cavity, illustrated 
by the solid arrow, is called the longitudinal direction. The other two 
directions, illustrated by dotted arrows, are called transverse directions. If 
the longitudinal length of the cavity is exactly equal to an integer number 
of half wavelengths of the light, the wave will constructively interfere with 
itself. However, if the longitudinal length of the cavity is not equal to 
an integer number of half wavelengths, it will destructively interfere. The 
same ideas apply in the transverse directions. In the figure, the longitudinal 
length of the cavity is equal to three half wavelengths shown by the solid 
sinusoid. The transverse lengths are both equal to one half wavelength 
shown by the dotted sinusoids. Because of this constructive or destructive 
interference, cavities selectively allow certain wavelengths of light to pass 
through while they attenuate other wavelengths of light. In a typical laser 
cavity, the ratio of the longitudinal length to the transverse lengths is much 
larger than is shown in Fig. 7.6. Figure 7.6 illustrates a rectangular cavity 
while many lasers have cylindrical cavities instead. The same ideas apply, 
so only certain allowed longitudinal and transverse modes propagate in 
cylindrical cavities too [86, p. 133,145]. 

If there is a pump and an active material in a cavity, this filtering effect 
encourages lasing to occur at specific wavelengths due to the feedback the 
cavity provides. As discussed above, stimulated emission occurs when a 
photon interacts with an excited electron. The result is another photon of 
the same frequency, electromagnetic polarization, phase, and direction as 
the original photon. When the pump first turns on, electrons are excited, 
but no photons are present. Very soon, some photons are produced by 
spontaneous emission. Some of these photons stimulate the emission of ad¬ 
ditional photons. Since the cavity selectively attenuates some wavelengths 
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but not others, photons produced by stimulated emission are more likely 
to occur at certain wavelengths corresponding to modes in the longitudinal 
direction. Eor these modes, the cavity length is equal to an integer multiple 
of half wavelengths. Due to the feedback of the laser cavity, these photons 
go on to stimulate additional identical photons. Eor this reason, the output 
of a laser necessarily has a very narrow wavelength range. 


7.4.3 Laser Efficiency 

The overall efficiency of a laser is the ratio of the output optical power 
over the input power. Many lasers are electrically pumped, and the overall 
efficiency, also known as the wall plug efficiency, for these lasers is the ratio 
of the output optical power over the input electrical power [10, p. 604]. 


Veff 


Pr 


optical out 


P 


electrical in 


(7.20) 


The pump, active material, and cavity all affect a laser’s efficiency. The 
overall efficiency is the product of a component due to the pump rjpump, a 
component due to the active material rjguantum, and a component due to 
the cavity rjcavity [86]. 


Veff ^pump ' ^quantum ' ^cavity \ * 

These factors vary widely from one type of laser to another. 

In an optically pumped laser, a lamp or another laser excites the elec¬ 
trons of the active material. In this case, some of the pump light may 
get reflected from the surface or transmitted through instead of absorbed 
by the active material. Also, some of the pump energy may be converted 
directly to heat. Additionally, especially in the case of lamps which emit 
light over a wide range of frequencies, the pump light may have too little 
energy per photon to excite the electrons, or the light may have too much 
energy per photon thereby exciting electrons to a different upper energy 
level. Also, some of the pump light may interact with electrons that are 
already in excited energy states. In an electrically pumped laser, electricity 
excites the electrons of the active material. Some of the electrical energy 
may be converted to heat instead of exciting the electrons. All of these fac¬ 
tors involving the pump contribute to rjpump and the overall laser efficiency 
Veff- 

The contribution to the overall laser efficiency due to the active mate¬ 
rial rjquantum IS more commonly known as the internal quantum efficiency. 
Some fraction of excited electrons decay to a lower energy level and emit 
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a photon by spontaneous or stimulated emission. Alternatively, other ex¬ 
cited electrons decay to a lower energy level while emitting heat or lattice 
vibrations instead. The internal quantum efficiency is the ratio of the rate 
with which excited electrons decay and produce a photon over the rate at 
which all excited electrons decay [10, p. 562]. It depends on temperature, 
the concentration of impurities or crystalline defects, and other factors [10, 
p. 596]. 

Efficiency is also determined by the laser cavity. A laser cavity reflects 
photons towards the active material. However, the laser cavity must let 
some light exit. In many lasers, the cavity is formed by mirrors. While 
these mirrors reflect most of the light, some light is absorbed and some light 
is transmitted through the mirrors as laser output. Many lasers which use 
mirrors include lenses, prisms, and other optical components in the cavity 
to focus or Alter light to the active material. These components may also 
reflect or absorb some light and thereby decrease the laser efficiency. As 
mentioned above, the cavity of many semiconductor lasers is formed by 
the interface between the active material and the surrounding air. While 
external mirrors can reflect over 99% of photons [86, p. 159], mirrors formed 
by semiconductor air interfaces are much less efficient. The amount of light 
reflected depends on the index of refraction of the material. In gallium 
arsenide, for example, the index of refraction is 3.52 which corresponds to 
only 31% of light reflected at each interface [97]. 

The influence on efficiency of internal absorption and mirror reflectivity 
can be summarized in a single relationship [98]. 

In(^) 

Veff = Veff-other / \ (7.22) 

ai + ln(jj) 

In this equation, R is the unitless mirror reflectivity, a is the absorption 
coefficient of the active material in units m“\ and I is the length of the 
active material in m. The term rj^ff-other represents the efficiency due to 
all other factors besides absorption and mirror reflectivity, and r^e// is the 
overall efficiency. Equation 7.22 can be rewritten with some algebra. 


Veff 


Vef f—other 



1 + 


Affi I T 


(7.23) 


These efficiency concepts generalize to other energy conversion devices 
which produce light. Equation 7.20 also describes the overall efficiency of 
LEDs and lamps in addition to electrically pumped lasers. The concepts 
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of efficiency due to the pump and internal quantum efficiency also apply to 
LEDs and lamps. However, rjcavity is not useful in describing these devices 
because LEDs and lamps do not contain a cavity. 


7.4,4 Laser Bandwidth 


Compared to LEDs and gas discharge lamps, incandescent lamps emit light 
over a much wider range of wavelengths. Compared to these devices, lasers 
emit light over a much narrower range of wavelengths. One reason that 
lasers emit over such a narrow wavelength range is that photons generated 
by stimulated emission have the same wavelength as the stimulating pho¬ 
ton. As explained above, another reason is that only light at integer half 
multiples of the length of an optical cavity constructively interfere. 

This narrow bandwidth of lasers compared to other sources of light 
is a major advantage in many applications. Eor example, lasers generate 
communication signals sent down optical fibers. Multiple signals can si¬ 
multaneously be sent down a single fiber with each signal produced by a 
laser at a slightly different frequency. Due to the narrow bandwidth, these 
signals can be separately detected at the receiver. 

Bandwidth of devices which emit fight is typically specified by the full 
width half maximum bandwidth (EWHM). More specifically, intensity of 
light emitted is plotted as a function of wavelength where optical intensity 
is proportional to the square of the electric field. To find the EWHM, iden¬ 
tify the wavelength of maximum intensity, and identify the wavelengths 
corresponding to half this intensity. The wavelength difference between 
these points of half intensity is called the EWHM, and this quantity is 
specified in meters or more likely nanometers. Sometimes EWHM is spec¬ 
ified in units of Hz instead. A frequency response plot, showing intensity 
of light emitted versus frequency, is used to find EWHM in Hz. Again two 
points at half maximum intensity are identified on the plot. The frequency 
difference between these points of half intensity is the EWHM in Hz. A 
related measure is called the quality factor, and lasers with narrow band¬ 
width have high quality factor. It is defined as the ratio of the wavelength 
in nm of peak intensity emitted over the EWHM in nm. Alternatively, it 
is defined as the ratio of the frequency of peak intensity over the EWHM 
in Hz. 


Quality factor = 


'^peak intensity in nm _ ^peak intensity in Hz 


EWHMi 


nm 


EWHM 


[H2 


(7.24) 


As an example, consider Eigs. 7.7 and 7.8 which are from [99]. Eigure 
7.7 relates to a dye laser where the active material is a liquid solution of 
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Wavelength (nm) 

Figure 7.7: Optical intensity versus wavelength for a dye laser with an 
active material of rhodamine 6G mixed with silver nanoparticles. The 
curves correspond to two different pump energies, one above the lasing 
threshold and the other below the lasing threshold. This figure is used 
with permission from [99]. 



Figure 7.8: Optical intensity versus pump energy for dye lasers. Curve 
a describes a laser with rhodamine 6G as the dye while the other curves 
describe a laser with rhodamine 6G mixed with various nanoparticles. This 
figure is used with permission from [99]. 
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the organic dye rhodamine 6G mixed with silver nanoparticles. Curve a of 
Eig. 7.8 relates to a dye laser with rhodamine 6G as the active material. 
The other curves of Eig. 7.8 relate to dye lasers with active materials 
made from rhodamine 6G doped with various nanoparticles. Typically, 
lasing will only occur if the active material is pumped strongly enough. If 
less energy is supplied, spontaneous emission occurs. Above a threshold, 
some spontaneous emission still occurs, but stimulated emission dominates. 
Eigure 7.7 plots the intensity of light emitted at two different pumping 
levels, above and below the threshold for lasing. Erom this figure, we 
can see that the bandwidth of light emitted when the device is producing 
only spontaneous emission is much broader than the bandwidth of light 
emitted when the device is lasing. The EWHM and quality factor in each 
case can be approximated from this figure. Eor the spontaneous emission 
curve, ^ 45 nm and quality factor ^ 13. Eor the stimulated 

emission curve, EWHMjj^j^j 5 nm and quality factor ^ 115. While these 
values are for dye lasers, other types of lasers, especially gas lasers, can 
have EWHM values which are orders of magnitude smaller, and values of 
0.01 nm are achievable [83, p. 625]. Eigure 7.8 illustrates another feature 
typical of lasers. Gonsider curve a which shows the intensity of the output 
versus pump energy supplied. The arrow in the figure near 65 mJ indicates 
the lasing threshold. Once lasing occurs, the intensity of the fight emitted 
increases due to the optical feedback, so a discontinuity in the slope of plots 
of this type can be seen at the lasing threshold. 

7.4.5 Laser Types 

Engineers have developed many types of lasers utilizing a wide range of ac¬ 
tive materials. Lasers can be classified based on the type of active material 
as gas lasers, dye lasers, solid state lasers, or semiconductor lasers. Most 
lasers fit into one of these four categories, but there are exceptions such as 
free electron lasers where lasing occurs between energy levels of unbound 
electrons [31, p. 277] [86, p. 417]. 

Gas lasers 

In a gas laser, the active material is a gas, and lasing occurs between 
energy levels of a neutral or ionized atom. Gas lasers are constructed from 
a gas filled glass tube. Electrodes inside the tube supply power to excite 
electrons of the gas atoms, and external mirrors form the cavity. One of the 
more common gas lasers is the helium neon laser, which typically operates 
at 632.8 nm [31, ch. 10]. However, the laser cavity may be designed so 
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that lasing occurs at 3.39 /rm and at other wavelengths too [31, ch. 10]. 
Another example of a common gas laser is the argon ion laser in which 
lasing occurs between energy levels of ionized argon. One advantage of 
gas lasers compared to other types of lasers is that they can be electrically 
pumped. Another advantage is that gas lasers can be designed to have 
high output powers. For this reason, gas lasers are used in applications 
requiring high power such as cutting, welding, and weaponry [ 86 , p. 405]. 
Carbon dioxide lasers can produce hundreds of kilowatts of power when 
operating continuous wave and terawatts of power when operating pulsed 
[ 86 , p. 405]. However, gas lasers are often physically large in size and not 
as portable as semiconductor lasers. High power gas lasers typically also 
require water cooling or another form of cooling. 

Dye Lasers 

In dye lasers, the active material is a solute in a liquid, and dye lasers are 
often optically pumped by other lasers [ 86 , p. 386]. Lasing may occur 
between molecular vibration energy levels as opposed to electron energy 
levels [31, p. 225] [ 86 , p. 386]. An advantage of dye lasers is that they may 
be tunable over a wide range of wavelengths. However, dye lasers require 
regular maintenance because the dyes have a finite useful lifetime [ 86 , p. 
391]. One example of a dye used is the organic molecule rhodamine 6 G, 
and lasers using this dye are tunable from 570 < A < 610 nm [31, p. 228] 
[ 86 , p. 387]. Figures 7.7 and 7.8 illustrate the behavior of a dye laser of 
this type. 

Solid state lasers 

The active material of a solid state laser is a solid insulating material, often 
a high purity crystal, doped with some element. Lasing occurs between 
electron energy levels of the dopant embedded in the solid. External mirrors 
are used to form the cavity. Solid state lasers are typically optically pumped 
by lamps or other lasers. A ruby laser is a solid state laser with an active 
material made from a crystal of sapphire, AI 2 O 3 , doped with around 0.05% 
by weight of chromium Cr^^ ions [31, ch. 10 ]. Ruby lasers are three level 
lasers [10, p. 476]. Another common solid state laser is a neodymium 
yttrium aluminum garnet laser, often denoted Nd:YAG, which is a four 
level laser. The active material of this laser is yttrium aluminum garnet 
Y 3 AI 5 O 12 doped with around 1 % of neodymium Nd^’*' ions, and this laser 
produces infrared fight at A = 1.0641 /rm [10, p. 478] [31, p. 208] [ 86 , p. 
539]. Another common laser is the titanium doped sapphire laser, denoted 
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Ti:Sapph. The active material of this laser is sapphire AI 2 O 3 doped with 
about one percent of titanium ions This laser is tunable in the range 

700 < A < 1020 nm [ 86 , p. 392], Tuning is achieved through an adjustable 
prism inside the laser cavity and through coatings on the mirrors of the 
cavity. Due to the tunability, these lasers are used for spectroscopy and 
materials research. 


Semiconductor lasers 

The active material of a semiconductor laser is a solid semiconductor pn 
junctions. An intrinsic, undoped, layer may be added between the p-type 
layer and the n-type layer at the junction to increase the width of the de¬ 
pletion region and improve overall efficiency [ 10 , p. 567]. As with diodes 
and LEDs, the entire device typically fits inside a cubic millimeter. The 
wavelength emitted depends on the energy gap of the semiconductor. The 
first semiconductor lasers were made from gallium arsenide and produced 
infrared light [93]. Since then, semiconductor lasers emitting at all visible 
frequencies have been produced. It took over thirty years from the time the 
first semiconductors were produced to the time reliable blue lasers were pro¬ 
duced [90] [100]. The first blue semiconductor lasers were produced using 
ZnMgSSe, and more commonly now GaN is used. Developing this technol¬ 
ogy required the ability to deposit very pure layers of the semiconductors 
without developing mechanical strain in the layers. 

Almost all semiconductor lasers are made from direct semiconductors. 
It is for this reason that the first semiconductor lasers were made from GaAs 
even though silicon processing technology was more developed at the time 
[93]. Direct semiconductors were defined in Section 6.4 and illustrated in 
Eig. 6 . 8 . In a direct semiconductor, the top of the valence band and the 
bottom of the conduction band line up in a plot of energy levels versus 
wave vector \k\. 

Eigure 7.9 is a sketch of energy levels versus wave vector for a direct 
semiconductor and an indirect semiconductor. In both cases, an electron 
is excited to the conduction band. In both cases, the electron can decay 
by spontaneous emission from the conduction band to the valence band. 
In both cases, both energy and momentum must be conserved. In the 
direct semiconductor case, the electron can decay by emitting a photon. 
The electron does not need to change momentum in the process. While 
it is not shown in the figure, the electron can also decay by stimulated 
emission. In the indirect case, spontaneous emission can occur, but this 
process necessarily requires a change in momentum of the electron too. 
While it is possible that spontaneous emission can occur and produce a 
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Direct Semiconductor 
E 



Indirect Semiconductor 
E 



Figure 7.9: Energy level diagram vs. wave vector illustrating spontaneous 
emission in a direct and indirect semiconductor. 


photon, often the electron decays by producing heat or vibrations instead 
of a photon of light [86, p. 444]. For this reason, stimulated emission is 
significantly less likely to occur in indirect than direct semiconductors. 

As discussed above, semiconductor lasers do not have external mir¬ 
rors. Semiconductor lasers can be broadly classified into two categories, 
edge emitters and vertieal eavity surfaee emitting lasers (VCSELs) [101] 
depending on whether the optical emission is from the edge or the surface 
of the device. In edge emitting lasers, the cavity is often formed by the 
edges of the semiconductor. In other edge emitting lasers called distributed 
feedback semiconductor lasers, a grating, which acts as an optical filter, is 
etched into the semiconductor. In vertical cavity surface emitting lasers, 
multiple epitaxial layers of different materials form mirrors above and below 
the active material. 

A main advantage of semiconductor lasers over other types of lasers is 
their small size. They can be integrated into both consumer devices like 
laser pointers and DVD players as well as industrial equipment and com¬ 
munication networks. Another large advantage is that they are electrically 
pumped. They also often do not need external cooling due to their rela¬ 
tively high overall efficiency. Another advantage is that the output wave¬ 
length can be designed by selecting the composition. For example, semi¬ 
conductor lasers of composition Ini_a;Gaa;Asi_j,Pj, produce infrared fight in 
the range 1.1 /rm < A < 1.6 /rm. This frequency range is particularly use¬ 
ful for optical communication networks. Fiber optic cables are made from 
Si02 glass, a material with very low but nonzero absorption. Absorption 
is a function of wavelength, and the absorption minimum of silica glass is 
near 1.55 /rm [10, p. 882]. These fibers also have low, but nonzero disper¬ 
sion. Dispersion refers to the spread of pulses as they propagate through 
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the fiber. The dispersion minimum in silica glass is around 1.3 /rm. [10, 
p. 879]. Semiconductor lasers producing fight in this range can be used to 
transmit signals down optical fibers, and these signals will have very low 
absorption and dispersion. A limitation is power output. While a semi¬ 
conductor laser can produce over a watt of power, gas lasers can produce 
orders of magnitude more power. 

7.4,6 Optical Amplifiers 

Optical amplifiers are quite similar to lasers, and they can be made from all 
types of active materials used to make lasers including gases, solid state ma¬ 
terials, semiconductors, and dyes [10, p. 477]. An optical amplifier consists 
of a pump and active material, but it does not have a cavity. The pump 
excites electrons of the active material to an upper energy level. Photons 
of an incoming optical signal cause additional photons to be generated by 
stimulated emission. Amplification occurs because these incoming photons 
generate additional photons, but lasing does not occur without the optical 
feedback provided by the cavity. 

Erbium doped fiber amplifiers are one of the most useful types of optical 
amplifiers because of their use in optical communication networks [10, p. 
882]. These devices can amplify optical signals without the need to con¬ 
vert them to or from electrical signals. They are solid state devices where 
stimulated emission occurs between energy levels of erbium, a dopant, in 
silica glass fibers. Energy from a semiconductor laser acts as the pump 
which excites electrons of the erbium atoms. Erbium doped fiber ampli¬ 
fiers are very useful because they can amplify optical signals near the fiber 
absorption minimum at 1.55/im. 


7.5 Relationship Between Devices 
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Figure 7.10: Devices which convert between electricity and light can be 
classified based on whether they involve absorption, spontaneous emission, 
or stimulated emission. Thick borders indicate categories of devices while 
thin borders indicate example types of devices. 
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Eigure 7.11: Devices which convert between electricity and light can be 
classified based on the type of active material. Thick borders indicate 
categories of devices while thin borders indicate example types of devices. 
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Scientists have come up with a wide variety of devices that convert 
between electricity and light, and they have come up with a wide variety 
of applications for these devices. Covering all such devices and all their 
applications is beyond the scope of this text. However, this chapter has 
shown some of the variety of devices. One way to classify devices which 
convert between electricity and light is to group them into categories based 
on whether they primarily involve absorption, spontaneous emission, or 
stimulated emission. Figure 7.10 illustrates how to classify many of the 
devices discussed in this chapter in this way. Ovals indicate absorption, 
rounded rectangles indicate spontaneous emission, and rectangles indicate 
stimulated emission. In the figure, thick borders indicate categories of de¬ 
vices while thin borders indicate example types of devices. Dotted fines 
indicate devices with gaseous active materials, dashed fines indicate semi¬ 
conducting active materials, mixed dotted and dashed lines indicate solid 
state active materials, widely spaced dotted lines indicate conductive active 
materials, and solid lines indicate mixed or other active materials. 

This diagram is far from complete because many other categories of 
devices exist, and these categories may be broken into further subcategories. 
Furthermore, only a handful of example devices are shown relating to some 
of the specific devices discussed above. This diagram includes absorption 
based devices discussed in Chapter 6. It also includes antennas discussed in 
Chapter 4. Light is a form of electromagnetic radiation with frequencies in 
the visible range. Light can be absorbed by a solar cell and spontaneously 
emitted by an LED, for example. Similarly, electromagnetic waves at longer 
wavelengths can be absorbed or spontaneously emitted by antennas which 
are devices with conductive active materials. 

A different way of classifying devices which convert between electricity 
and light is to classify them based on the type of active material. All of 
these devices involve the interaction of light and atoms. The atoms involved 
may be part of a gas, may be dopants inside an insulating solid, may be 
part of a bulk semiconductor material, or may be part of a conductive 
solid. This way of classifying devices is illustrated in the Fig. 7.11. Dotted 
lines indicate devices with gaseous active materials, dashed lines indicate 
semiconducting active materials, mixed dotted and dashed lines indicate 
solid state active materials, widely spaced dotted lines indicate conductive 
active materials, and solid fines indicate mixed or other active materials. 
As in the previous figure, ovals indicate absorption, rounded rectangles 
indicate spontaneous emission, and rectangles indicate stimulated emission. 
Antennas are shown twice in the conductor based devices category because 
receiving antennas involve absorption while transmitting antennas involve 
spontaneous emission. Also as in the previous figure, thick borders indicate 



7 LAMPS, LEDS, AND LASERS 


169 


categories of devices while thin borders indicate example types of devices. 
Again, this figure does not show a complete list of all possible devices or 
device categories, but it does illustrate relationships between some devices 
discussed in this chapter. 

Devices are usually designed to involve only one of these processes of 
absorption, spontaneous emission, or stimulated emission. However, it is 
possible for multiple of these processes to occur in a single device depend¬ 
ing on how it is operated. Eor example, a semiconductor laser converts 
electricity to fight by stimulated emission when current above the lasing 
threshold is supplied. If a weaker current is supplied, the device will act 
as an LED which converts electricity to light by spontaneous emission. If 
light shines on the device and the voltage across the device is measured, 
the same device acts as a photodetector which converts fight to electricity 
by absorption. Similarly, photomultiplier tubes, gas discharge lamps, and 
gas lasers all involve tubes of gas with electrodes to supply or measure 
electricity. Like many energy conversion devices, these devices may con¬ 
vert electricity to fight when operated in one direction and convert light to 
electricity when operated in reverse. 

7.6 Problems 


7.1. Identify whether the devices below operate based on spontaneous 
emission, stimulated emission, or absorption. 

• Light emitting diode 

• Gas discharge lamp 

• Argon ion laser 

• Solar panel 

• Semiconductor laser 

7.2. Consider a blackbody radiator at a temperature of 6500 K. Use Mat- 
lab, or similar software, to answer this question. 

(a) Eind the frequency which corresponds to peak spectral energy 
density per unit bandwidth. 

(b) Eind the wavelength which corresponds to peak spectral energy 
density per unit bandwidth. 
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(c) Find the value of the spectral energy density per unit bandwidth 
in ^ at the frequency found in part a. 

7.3. A semiconductor laser which emits A = 500 nm light has a length of 
800 /rm. The width is 12 /rm, and the thickness is 5 /rm. How many 
wavelengths long is the device in the longitudinal direction? How 
many wavelengths long is the device in each transverse direction? 

7.4. Assume a semiconductor laser has a length of 800 fxm. Laser emission 
can occur when the cavity length is equal to an integer number of 
half wavelengths. What wavelengths in the range 650 nm < A < 
652 nm can this laser emit, and in each case, list the cavity length in 
wavelengths. 

7.5. Assume two energy levels of a gas laser are separated by 1.4 eV, and 
assume that they are equally degenerate {gi = g 2 ). The spontaneous 
emission Einstein coefficient for transitions between these energy lev¬ 
els is given by A 12 = 3 ■ 10® s“b Find the other two Einstein coeffi¬ 
cients, Bi 2 and B 21 . 

7.6. The energy gap of AlAs is 2.3 eV, and the energy gap of AlSb is 1.7 
eV [9, p. 19]. Energy gaps of materials of composition AlAsa;Sbi_a, 
with 0 < a; < 1 vary approximately linearly between these values 
[9, p. 19]. Suppose you would like to make a semiconductor laser 
from a material of composition AlAs^^Sbi-a,. Find the value of x that 
specifies the composition of a material which emits fight at wavelength 
A = 640 nm. 

7.7. Laser spectra are often modeled by Lorentzian functions. A Lorentzian 
function centered at the origin with area under the curve of unity has 
equation 

1 0.5 ■ FWHM 

^ ^ ~ TT ' + (0.5 ■ FWHM)^ 

where FWHM is the full width at half maximum. The maximum 
value of this function is pyyHM ' of f'ig- 

is centered near A = 570 nm, has a FWHM of 5 nm, and it has a 
maximum luminescence intensity of 49. Find a Lorentzian equation 
that can model this particular spectrum. 
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7.8. As discussed in the previous problem, laser spectra are often modeled 
by Lorentzian functions. To better understand Lorentzian functions, 
use Matlab or similar software for this problem. 

(a) Plot a Lorentzian function centered at the origin with EWHM 
5 and maximum amplitude of unity. On the same axis, plot a 
Gaussian function also centered at the origin with EWHM 5 and 
maximum amplitude of unity. 

(b) Repeat part a, but put the vertical axis of your plots on a log 
scale. 

7.9. The figure illustrates a laser spectrum. Approximately find: 

(a) The wavelength of peak intensity 

(b) The EWHM 

(c) The quality factor 
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7.10. Three main components of a laser are the pump, active material, and 
cavity. Four main types of lasers are gas lasers, semiconductor lasers, 
dye lasers, and solid state lasers. Match the example component with 
the best description of the type of component and type of laser it is 
found in specified. (Each answer will be used once.) 


Example Component 

1. Edges of a AlGaAs crystal 

2. Rhodamine 6G liquid solution 

3. External mirror made of Si02 glass coated with 
aluminum 

4, Battery of a laser pointer 

5. Si02 glass doped with 1% Er atoms 

6. GO 2 gas in an enclosed tube 

7. Pn junction made from InGaAs 

8. Argon ion laser used to supply energy to excite 
electrons of a Ti doped Sapphire 

Description 


A. Gavity of a semiconductor laser 

B. Gavity of a gas laser 

C. Active material of a semiconductor laser 

D. Active material of a gas laser 

E, Active material of a dye laser 

F, Active material of a solid state laser 

G. Pump of a semiconductor laser 

H. Pump of a solid state laser 


7.11. The intensity from sunlight on a bright sunny day is around 0.1 

Laser power can be confined to a very small spot size. Assume a 
laser produces a beam with spot size 1 mm^. For what laser power 
in watts will the intensity of the beam be equivalent to the intensity 
from sunlight on sunny day? Staring at the sun can damage an eye, 
so staring at a laser beam of this intensity is dangerous for the same 


reason. 
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8 Thermoelectrics 

8.1 Introduction 

A thermoelectric device is a device which converts a temperature differential 
to electricity, or vice versa, and it is made from a junction of two differ¬ 
ent conductors or semiconductors. To understand thermoelectric devices, 
we need to understand the fundamentals of heat transfer and thermody¬ 
namics. This chapter begins by discussing these fundamental ideas. Next, 
thermoelectric effects and thermoelectric devices are discussed. 

Many common processes heat an object. Rubbing blocks together, for 
example, heats them by friction. Burning a log converts the chemical energy 
in the wood to thermal energy, and applying a current to a resistor also 
heats it up. How can we cool an object? If we supply electricity to a 
thermoelectric device, one side heats up and the other cools down. We can 
place the object we want to cool near the cooler side of the thermoelectric 
device. 

Thermoelectric devices, pyroelectric devices, and thermionic devices all 
convert energy between a temperature difference and electricity. Pyroelec¬ 
tric devices were discussed in Sec. 3.2. They are made from an insulat¬ 
ing material instead of from a junction of conductors or semiconductors. 
Thermionic devices are discussed in Sec. 10.2, and they involve heating a 
cathode until electrons evaporate off. Thermoelectric devices, discussed 
in this chapter, are much more common than pyroelectric devices and 
thermionic devices due to their efficiency and durability. 

8.2 Thermodynamic Properties 

A container of air of fixed mass confined to a volume stores energy. We can 
shrink the volume of the air. This process requires energy, and the shrunken 
volume of air stores more energy. We can increase the gas pressure, for 
example, by exerting a force on a piston within which air is confined. This 
process requires energy, and the air under pressure stores more energy. We 
can take the fixed volume of air and heat it too. It takes energy to heat 
the air, and the hotter air stores more energy. Similarly, we can shake 
the container of air. Again, this process requires energy, and the energy 
from shaking is stored in the internal energy, the random motion, of the 
air molecules. 

To talk about thermodynamic energy conversion, we need to define four 
fundamental properties of a system: volume, pressure, temperature, and 
entropy. All of these properties depend on the current state, not the past 
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Units for Pressure 

1 

^ = 1 Pa 

1 

bar = 10® Pa 

1 

mmHg= 133.322 Pa 

1 

atm = 101, 325 Pa 

1 

psi = 6.894757 ■ 10® Pa 


Table 8.1: Pressure unit conversion factors [68]. 


history, of the sample. These properties can be classified as intensive or 
extensive [2, p. 10]. An intensive property is independent of the size or 
extent of the material. An extensive property depends on the size or extent 

[2, p. 10]. 

Volume V is an extensive property measured in m^ or liters where 1 L = 
0.001 m^. Pressure P is an intensive property measured in the SI units of 
pascals where 1 Pa=l Pressure is also measured in a wide variety 
of other, non-SI, units such as bars, millimeters of mercury, or standard 
atmospheres as fisted in Table 8.1. Pressure measures are often specified in 
comparison to the lowest possible pressure, of a complete vacuum, and such 
pressure measurements are called absolute pressure measurements [102, p. 
15-17]. In some cases, values are specified as the difference above the local 
atmospheric pressure, and these measurements are called gauge pressure 
measurements [102, p. 15-17]. In other cases, values are specified as the 
difference below the local atmospheric pressure, and these measurements 
are called vaeuum pressure measurements [102, p. 15-17]. Unless otherwise 
specified, the term pressure in this text refers to absolute pressure, not 
gauge or vacuum pressure. 


Symbol 

Quantity 

Unit 

Ext/int 

V 

Volume 


Extensive 

P 

Pressure 

Pa 

Intensive 

T 

Temperature 

K 

Intensive 

S 

Entropy 

J 

K 

Extensive 


Table 8.2: Thermodynamic properties. 
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Symbol 

Name 

Value and Unit 

ks 

Boltzmann constant 

1.381 ■ 10-23 ^ 

R 

Molar gas constant 

molK 

Na 

Avogadro constant 

6.022 ■ 1023 1 , 

mol 


Table 8.3: Values of the Boltzmann constant, the molar gas constant, and 
the Avogadro constant. 


Temperature T is an intensive property measured in either the SI units 
of degrees Celsius or kelvins. By definition, we can relate the two units by 




273.15 


( 8 . 1 ) 


[68]. We can also measure temperature in the non-SI unit of degrees Eahren- 
heit. Temperature in degrees Celsius and temperature in degrees Eahren- 


heit are related by 




( 8 . 2 ) 


As with absolute pressure measurements, temperature in kelvins is said to 
be measured on an absolute temperature scale because the lowest possible 
temperature is given by zero kelvin. All temperatures are either absolute 
zero or have positive values. We use the term temperature to describe 
a property of a system. We use the term heat transfer to describe the 
proeess of transferring energy from a hot to a cold object. Entropy S is 
measured in units and it is an extensive property. Intuitively, entropy 
is a measure of the lack of order or organization of a material. The atoms 
in an amorphous material are less ordered than the atoms in a crystal of 
the same composition, so the amorphous material has more entropy. 

Some further definitions will be needed. The symbol N represents the 
number of atoms or molecules of a substance. While it is not usually 
considered a fundamental thermodynamic property, it is a useful property 
of a sample. Sometimes it is specified in the units of mols instead of by the 
number of atoms or molecules. The Avogadro constant 

Na = 6.022 ■ 10^3 ^ (8.3) 

mol 

is a constant which is used to convert a number given to the number per 
mol. The molar gas constant is 

M = 8.314 —-—. 

mol ■ K 


(8.4) 
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Material 

Bulk 

modulus 

B in GPa 

Thermal 

cond. K 

■ W 

111 

m K 

Electrical cond. a 

in ^ 
o m 

Ref. 

Diamond 

539 

300 

1 ■ 10-^2 _ X . iq-2 

|104] 

Stainless steel 

143 

15.5 

1.3 ■ 10^ - 1.5 ■ 10® 

|105] 

Graphite 

18.6 

195 

o 

1 

o 

o 

-4 

[106] 

Silicone rubber 

1.75 

1.38 

1-10-^^-3.2-10-i2 

|107] 


Table 8.4: Bulk modulus, thermal conductivity, and electrical conductivity 
of some materials. The references list ranges of values for bulk modulus 
and thermal conductivity while this table lists their averages. 


The Boltzmann constant is 

ks = 1.381 ■ 10"^^ (8.5) 

These three constants are related by 

M. , . 

ks = -. ( 8 , 6 ) 

8.3 Bulk Modulus and Related Measures 

The bulk modulus B describes how a gas, liquid, or solid changes as it is 
compressed [103]. More specifically, bulk modulus per unit volume is the 
change in pressure required to get a given compression of volume, 

(9P 

B = -V^ (8.7) 

and bulk modulus is specified in the SI units of pascals or The bulk 
modulus is greater than zero (B > 0) even though there is a minus sign in 
Eq. 8.7 because volume shrinks when pressure is applied. Table 8.4 lists 
example bulk modulus values. 

Assuming constant temperature, the inverse of the bulk modulus i|, 
is also called the isothermal compressibility [108]. There is a relationship 
between this compressibility and the permittivity e discussed in Chapter 
2. If we take an insulating material and apply an external electric field, 
a material polarization is established, and energy is stored in this charge 
accumulation. The permittivity is a measure of the charge accumulation 
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per unit volume for a given strength of external electric field, in units of 
It is the ratio of the displacement flux density to the electric field 
intensity 

1^1 


e = 


1^ 


( 8 . 8 ) 


If we take a material and apply an external pressure, the material com¬ 
presses and energy is stored in this compressed volume. The inverse of the 
bulk modulus per unit volume is a measure of the change in volume for a 
given external pressure 


1 _ dY 

(I) 


(8.9) 


TTi 3 

in units of ^ . Both Eqs. 8.8 and 8.9 can be called constitutive rela¬ 
tionships because they describe how a material changes when an external 
influence is applied. 

Multiple other measures describe the variation of a gas, liquid, or solid, 
with respect to variation of a thermodynamic property. The specific heat 
describes the ability of a material to store thermal energy, and it has units 
[109, p. 98]. More specifically, the specific heat over temperature is 

equal to the change in entropy with respect to change in temperature [108]. 
It may be given either assuming a constant volume or assuming a constant 
pressure. 


Specific heat at constant volume = 


Specific heat at constant pressure = 


rr.dS 

= T — 
dT 


rr.dS 

T — 
dT 


( 8 , 10 ) 

( 8 . 11 ) 


The Joule-Thomson coefficient is defined as the ratio of change in tem¬ 
perature to change in pressure for a given total energy of the system 


Joule-Thomson coefficient 


dT 


( 8 , 12 ) 


and it has units ^ [102, p. 685]. When a pressure is applied and overall 
energy is held fixecfbut entropy is allowed to vary, some materials cool and 
others heat. So, this coefficient may be positive, negative, or even zero at 
an inversion point. Additionally, the volume expansivity is defined as 


Volume expansivity 


l^dV 

Y&T 


(8.13) 


[108]. 
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8.4 Ideal Gas Law 

In most materials, if we know three of the four thermodynamic properties, 
volume, pressure, temperature, and entropy, we can derive the fourth prop¬ 
erty as well as other thermodynamic measures. Such materials are called 
simple compressible systems [109, 102]. For such materials, the ideal gas 
law relates pressure, volume, and temperature. 

PV = NMT. (8.14) 

While this is not a mathematical law, it is a good description of gases, and 
it can be used as a rough approximation for liquids and solids. Consider 
a container filled with a gas. If the volume of the container is compressed 
while the temperature is kept constant, the pressure increases. If the gas is 
heated and the pressure is kept constant, the volume increases. The energy 
stored in a gas that undergoes change in volume at constant temperature 
is given by 



where the change in energy is specified by 

AE = PAV. 

The ideal gas law can be written incorporating entropy as 

PV = ST. 


(8,16) 


(8,16) 

(8,17) 


For example, consider a 10 L tank that holds 5 mol of argon atoms. The 
argon gas is at a temperature of T = 15 °C. Find the pressure in the tank 
in pascals and in atm. We begin by converting the volume and temperature 
to more convenient units, V = 0.01 m^ and T = 288.15 K. Next, the ideal 
gas law provides the pressure in Pa. 


P = 


5 mol ■ 8.314 


mol K 


.15 K 


V 0.01 m^ 

Finally, we convert the pressure to the desired units. 

1 atm 


= 1.20 ■ lO*’ Pa 


P = 1.20 ■ lO*’ Pa ■ 


101325 Pa 


= 11.8 atm 


(8,18) 


(8,19) 


As another example, consider a container that holds neon atoms at a 
temperature of T = 25 °C. Assume that the pressure in the container is 10 
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kPa, and the mass of the neon in the container is 3000 g. Eind the volume 
of the container. The temperature is 298.15 K. Erom a periodic table, the 
atomic weight of a neon atom is 20.18 Thus, the container holds 148.7 
mol of neon atoms. Next, we use the ideal gas law to find the volume. 

NMT 148.7 mol ■ 8.314 . 298.15 K 

^ ^ ^ ^ gg ^3 

P 104 Pa ^ 

8.5 First Law of Thermodynamics 

The idea of energy conservation was introduced in Sec. 1.3. Most dis¬ 
cussions of thermodynamics also begin with the same idea. The first law 
of thermodynamics is a statement of energy conservation. Energy can be 
stored in the material polarization of a capacitor, the chemical potential 
of a battery, and in many other forms. People studying thermodynamics 
and heat transfer, however, often make some drastic assumptions. They 
classify all energy conversion processes as heat transfer or other where the 
primary component of the latter is mechanical work. At the beginning 
of introductory thermodynamics courses, all forms of energy besides heat 
transfer and mechanical work are ignored. Charging a capacitor, discharg¬ 
ing a battery, and all other energy conversion processes are grouped in with 
mechanical work when writing the first law of thermodynamics. The first 
law of thermodynamics is typically written as 



(change in int. energy) = (heat in) — (work and other forms). (8.21) 

AU = Q - VP (8.22) 

Each term of the Eq. 8.22 has the units of joules. The symbol Q represents 
the energy supplied in to the system by heating, and —W, with the minus 
symbol, represents the mechanical work in to the system as well as all 
other forms of energy into the system. The quantity AU represents the 
change in internal energy of the system. In a closed system the total energy 
is conserved. In a closed system, energy is either stored in the system 
(for example as potential energy or another form of internal energy), is 
transfered in or out as heat, or is transfered in or out as another form such 
as mechanical work [109, p. 51]. 
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Figure 8.1: Illustration of closed systems containing energy conversion de¬ 
vices. 

As an example, consider the closed system shown on the left part of 
Fig. 8.1 comprised of a cylinder with a piston and a heater. Assume the 
cylinder contains a fixed volume of gas inside. Suppose the heater is used 
to transfer 100 J of energy into the piston in an hour while the piston is 
forced to remain in a fixed position. After the hour, the internal energy of 
the gas will be 100 J greater than before. Again suppose the heater is used 
to transfer 100 J of energy into the gas, but this time assume the piston 
is allowed to move thereby expanding the gas volume. After the hour, the 
internal energy of the gas will be the original energy of the gas, plus the 
100 J supplied by the heater, and minus a factor due to the mechanical 
work done by the piston. 

The first law of thermodynamics says two things. First, energy is con¬ 
served. Second, energy can be stored, converted to mechanical work, or 
converted to heat. We know energy can be converted to other forms too, 
like electrical or electromagnetic energy. While introductory thermody¬ 
namics classes do not usually do so, we can add other devices to the piston 
as shown on the right part of Fig. 8.1. We can include a battery and put a 
resistor inside to convert the chemical energy to electrical energy, and the 
resistor can heat the air in the piston. We can put a mass and a spring in 
the piston and convert potential energy of a compressed spring to kinetic 
energy by removing a clip which holds the spring compressed. In a closed 
system when all energy conversion processes are considered, energy must 
be conserved. 

8.6 Thermoelectric Effects 

8.6.1 Three Related Effects 

In the 1800s, three effects were experimentally observed. At first, it was 
not obvious that these experiments were related, but soon they were found 
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Seebeck Effect 


Peltier Effect 


Thomson Effect 


I 


1 

CM 

1 


hot 


cold 


ARi 


12 


heating/cooling 


hot 


cold 

heating/cooling 


Eigure 8.2: The Seebeck effect, Peltier effect, and Thomson effect. 


to be three aspects of the same phenomenon [5, p. 113]. 

The first effect, now called the Seebeck effect, was discovered in 1821 
by Thomas Seebeck [5, p. 113]. It is observed in a junction of two differ¬ 
ent metals or semiconductors. As discussed in Section 6.5, junctions are 
also used to make photovoltaic devices, LEDs, and semiconductor lasers. 
When the different sides of the junction are held at different temperatures, 
a voltage develops across the junction. The Seebeck coefficient $, in units 
is defined as the ratio of that voltage to the temperature difference. 
More specifically, consider a junction where one side is held at a hotter 
temperature than the other, as shown in the left part of Eig. 8.2. The dif¬ 
ference between the Seebeck coefficient in material one $i and the Seebeck 
coefficient in material two $2 is given by the measured voltage across the 
iunction AVu divided by the temperature difference between the materials 
AT 12 [110, p. 24]. 


$, - $2 = ^ (8.23) 

The difference between the Seebeck coefficients can be positive or negative 
because both the temperature difference and the measured voltage can be 
positive or negative. However, for any given material, the Seebeck coeffi¬ 
cient is positive. To find the Seebeck coefficient for an unknown material, 
form a junction between that material and a material with known Seebeck 
coefficient, heat one end of the junction hotter than the other, and mea¬ 
sure the voltage established. Eor most materials, the Seebeck coefficient is 
less than 1 Some of the largest values of the Seebeck coefficient are 
found in materials containing tellurium. Eor example, (Bio. 7 Sbo. 3)2 Tea has 

$ ?« 230 ^ and PbTe has $ ^ 400 ^ [3]. 

To understand the physics behind the Seebeck effect, consider the flow 
of charges across this diode-like device. In metals, valence electrons are the 
charge carriers, and in semiconductors, both valence electrons and holes 
are the charge carriers. These charge carriers diffuse from the hot to the 
cold side of the junction. Consider a junction of two metals with no net 
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charge on either side initially. If an electron moves from the hot side to the 
cold side, the hot side then will have a net positive charge, and the cold 
side will have a net negative charge. This movement of charges sets up an 
electric field and hence a voltage. 

If we let the sample reach an equilibrium temperature, no voltage will 
be measured. A voltage is measured only during the time when charge 
carriers have diffused from one material to the other but when the material 
has not reached a uniform temperature. Thus, for a material to have a 
large thermoeleetrie ejfeet, it must have a large eleetrieal eonduetivity and 
small thermal eonduetivity. Thermoelectric devices are typically made from 
metals or semimetals because these materials satisfy this condition. 

The second effect was discovered by Jean Peltier in 1834 [5, p. 113]. 
The Peltier ejfeet is also observed in a junction of two different metals, 
semimetals, or semiconductors. It is illustrated in the middle part of Fig. 
8.2. When a current, / in amperes, is supplied across a junction, heat is 
transferred. This effect occurs because charges from the supplied current 
fiow through different materials with different thermal conductivities on 
the different sides of the junction. The effect is quantified by the Peltier 
eoeffieient for the junction, Yii 2 , or Peltier coefficients for the materials 
forming the junction. Hi and 112. More specifically, the Peltier coefficient 
is defined as 

ni2 = Hi - n2 = (8.24) 

in the units of volts [110, p. 24]. The term ^ represents the rate heat is 

transferred in and it may be positive or negative because the thermal 
conductivity in the first material may be higher or lower than in the second 
material. The Seebeck coefficient and the Peltier coefficient are related by 

ffi ~ ^2 — ($1 ~ $ 2 ) T. (8.25) 

PbTe is a material with a relatively high Seebeck coefficient. At room 
temperature, it has coefficients $ = 400 ^ and 

n = 400 ^■300K = 0.12 V. (8.26) 

The third effect was first discovered by William Thomson in the 1860s 
[3]. Thomson also derived the relationship between these three effects. It 
is illustrated on the right part of Fig. 8.2. When a current passes through 
a uniform piece of material which has a temperature gradient, heating or 
cooling will occur, and this result is known as the Thomson ejfeet [3, p. 
148] [110, p. 24] [5, p. 115]. To observe this effect, apply a temperature 
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gradient across a piece of metal or semiconductor and also apply a current 
through the length of the material. Heating or cooling can be measured, 
and this effect is described by another coefficient. The Thomson coefficient 
T also has units It is defined as the rate of heat generated over the 
product of the current and temperature difference. 


r = 


__ 

nn-T,) 


(8.27) 


The Thomson and Seebeck coefficients for a single material are related by 

f fdT = $ (8.28) 

where the integral is over temperature [110, p. 24]. 

These effects work both ways. We can use the Peltier effect, for example, 
to make either a heating or a cooling device. We can supply a current 
across a junction to produce a temperature differential, or we can supply 
a temperature difference to generate a current. All three effects relate to 
the fact that when the electrical conductivity is larger than the thermal 
conductivity, energy can be converted between a temperature differential 
and electricity. As an aside, materials with low electrical conductivity and 
high thermal conductivity are also used to make energy conversion devices. 
Components of motors and generators are often made from layers of metal 
and dielectrics with these properties [111, ch. 8]. 


8.6.2 Electrical Conductivity 


Electrical conductivity a, in units is a measure of the ability of charges 
to flow through a material. Resistivity is the inverse of electrical conductiv¬ 
ity, p = y- Example electrical conductivity values are listed in Table 8.4, 
found in Section 8.3. Eew tools are needed to measure these quantities. 
An ohmmeter can be used to find the resistance R, in ohms, of a sample 
with known length I and cross sectional area A. The conductivity can be 
calculated directly. 



(8.29) 


Electrical conductivity is the product of the number of charges flowing 
and their mobility. Eor conductors, valence electrons are charge carriers 
that flow, so conductivity can be expressed as [9, p. 84] 


a = nqnn 


(8.30) 
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In this expression, n is 
electrons 


the concentration of valence electrons in units 
—, and it was introduced in Sec. 5.2. The magnitude of the charge 
of an electron is g = 1.6 ■ 10“^® C. Mobility of electrons, 


1.6 ■ 10“^^ C. Mobility of electrons, /i„, is the ease 
with which charge carriers drift in a material, and it has units By 

definition, mobility is the ratio of the average drift velocity of electrons to 
the applied electric field in ^ [9, p. 84]. 




-avg drift velocity of electrons 


t 


(8,31) 


For semiconductors, both electrons and holes act as charge carriers, and 
both contribute to the conductivity, 

a = q{nnn+Ptip) (8.32) 

where Pp is the mobility of the holes, and p is the concentration of holes. 

To understand which materials have large electrical conductivities, and 
hence make good thermoelectric devices, we need to consider the charge 
concentrations n and p. Conductors and semiconductors have charges that 
can move through the material while insulators do not. Thus conduc¬ 
tors and semiconductors have large electrical conductivity and are used 
to make thermoelectric devices. Furthermore, a doped semiconductor has 
more charge carriers than an undoped, also called intrinsic, semiconductor. 
Thus, doped semiconductors usually have higher electrical conductivity 
than undoped semiconductors of the same material [110]. 

Electrical conductivity a is proportional to the electron and hole mo¬ 
bilities, pn and Pp, and the mobilities are a strong function of temperature 
[9]. For this reason, the electrical conductivity is a function of temper¬ 
ature. At low temperature, mobilities are limited by impurity scattering 
while at high temperatures, they are limited by phonon scattering. At 
some intermediate temperature, mobility and conductivity are maximum, 
and this peak occurs at different temperatures for different materials. Mo¬ 
bility also depends on whether a material is crystalline or amorphous and 
on the degree of crystallinity. Mobility and electrical conductivity are both 
typically higher in crystals than glasses because charges are more likely to 
get scattered in amorphous materials. 

8.6.3 Thermal Conductivity 

Thermal conductivity k is a measure of the ability of heat to fiow through a 
material, and it has units [109, p. 793]. Example thermal conductivity 
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thermocouples 



heater heat 

sink 


Eigure 8.3: Components used to measure thermal conductivity. 


values are listed in Table 8.4, found in Section 8.3. Understanding thermal 
conductivity is complicated because a number of mechanisms are respon¬ 
sible for the conduction of heat. Heat may be transported by phonons, 
photons, electrons, or other mechanisms, and each mechanism depends on 
temperature and the properties of the material. Good thermoelectric de¬ 
vices have small thermal conductivity. Often metals have large thermal 
conductivity and insulators have small thermal conductivity. 

The apparatus to measure thermal conductivity consists of a heater, a 
heat sink, and a number of thermocouples [110] [112]. To measure thermal 
conductivity experimentally, start with a bar of material with a known 
cross sectional area A. Heat one end of the bar with respect to the other, 
wait for a steady thermal state, and measure the temperature at each end 
of the bar. Next, calculate the temperature gradient ^ in units ^ along 
the length of the bar. Also measure the rate that heat is supplied to the 
bar, in units i. By definition, thermal conductivity is the ratio 


K = 


(power dissipated in heater) (distance between thermocouples) 


(cross sectional area) (change in temp) 


[109, p. 49]. The thermal conductivity can be calculated from 


(8.33) 


dt 

This technique works well for low to moderate temperatures and materials 
with high thermal conductivity. Other methods exist to measure thermal 
conductivity and are advantageous for different temperature or conductiv¬ 
ity ranges. 

Another way to understand thermal conductivity is to think of it as 
the product of the amount of heat transported by some particle times the 
velocity of that particle. This viewpoint applies whether or not actual 
particles are involved in the heat transport. More specifically, thermal 
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conductivity is given by 

K = Cv\~^\l (8.35) 

The symbol represents the specific heat at constant volume in and 

the symbol |^| represents the magnitude of the transport velocity in 
The scattering length is represented by I in m. 

Regardless of whether electrons, phonons, or something else transports 
heat through a material, the ability of that heat to get from one end to 
the other without being scattered or blocked infiuences the thermal con¬ 
ductivity. Thus, crystals typically have higher thermal conductivity than 
amorphous materials [113]. The thermal conductivity of a crystal can be 
lowered by exposing the material to radiation which destroys the crys¬ 
tallinity and increases the likelihood that the heat carrier will be scattered 
[110]. For glasses, scattering length is roughly the interatomic spacing [112]. 
Also, thermal conductivity of glasses is less temperature dependent than 
crystals because high temperatures distort the perfect crystallinity, thereby 
lowering the thermal conductivity for crystals but not glasses [113]. 

All the contributing factors, C„, |n |, and scattering length /, are temper¬ 
ature dependent, so the thermal conductivity is a function of temperature. 
The temperature dependence of the factors is discussed in reference [110]. 
Thermal conductivity, like electrical conductivity, is low at low tempera¬ 
tures then rises to a maximum before decreasing again at higher tempera¬ 
tures [112]. 

8.6.4 Figure of Merit 

The figure of merit of a thermoelectric device, Z, is a single measure that 
summarizes how good a material is for making thermoelectric devices. It 
is defined as 

$^(T 

Z=-, 8.36 

K 

and it has units K“^. It depends on the Seebeck coefficient $, electrical 
conductivity cr, and thermal conductivity k. A large value of Z indicates 
that the material is a good choice for use in construction of a thermoelectric 
device. 

The figure of merit depends on temperature because the parameters 
$, (j, and K are strong functions of temperature. Thus, the best choice ma¬ 
terial for a thermoelectric device operating near room temperature may not 
be the best choice for a device operating at other temperatures. Sometimes 
ZT is used as a measure instead of just Z to account for the temperature 
dependence. 
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The figure of merit does not incorporate all of the temperature related 
factors to consider in selecting materials for thermoelectric devices. Melt¬ 
ing temperature is also important. A thermoelectric device converts more 
energy when a larger temperature difference is placed across the device. 
The Seebeck coefficient is inversely proportional to the temperature differ¬ 
ential, $ = ^. However, too large of a temperature differential will melt 
the hot end of the device, and different materials can have very different 
melting temperatures. Eor example, lead telluride PbTe melts at 924 °C, 
and Bi 2 Te 3 melts at 580 °C [114, p. 4-52, 4-71]. 

The figure of merit also depends on doping level because the electrical 
conductivity is directly proportional to the charge concentrations n and p 
[110]. Thus, a thermoelectric device made from a doped semiconductor has 
a higher electrical conductivity and thermoelectric efficiency than a device 
made from an undoped semiconductor of the same material. The Seebeck 
coefficient is also dependent on doping level but not as strongly [110]. Ther¬ 
mal conductivity is not a strong function of charge concentrations n and p 
[110]. Thus, thermoelectric materials are often made from heavily doped 
semiconductors or from conductors. 

The figure of merit also depends on degree of crystallinity. Typically, 
both the electrical conductivity and thermal conductivity are much higher 
in crystals than glasses because charge and heat carriers are less likely to 
get scattered as they travel through crystals than glasses [113]. Since both 
electrical and thermal conductivity are influenced, the effect of degree of 
crystallinity on the figure of merit can be complicated. 

Thermoelectric devices are typically made from junctions of two dif¬ 
ferent metals or semiconductors. Essentially, a thermoelectric device is a 
diode. Common materials used include bismuth telluride, lead telluride, 
and antimony telluride, all of which are semimetals. Bi, Sb, and Pb are all 
located near each other on the periodic table. Other materials studied for 
use in thermoelectric devices include [110], BiSeTe, LiMnO, LiPeO, LiCoO, 
LiNiO, PbS, and ZnSb. These materials are either small gap semiconduc¬ 
tors or semimetals. In semiconductor materials with small energy gaps, the 
ratio of electrical conductivity to thermal conductivity is large. However, 
this fact must be balanced against the fact that smaller gap semiconductors 
tend to have lower melting temperatures than larger gap semiconductors 

[no, ch. 1]. 

Recently, layered materials and superlattices have been considered as 
materials for thermoelectric devices [115] [116]. The layers can be tailored 
to affect the thermal and electrical properties differently and can act like 
a filter to select out different conduction mechanisms. Understanding of 
the conductivity mechanisms is a prerequisite to understanding such more 
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complicated structures. 

8.7 Thermoelectric Efficiency 

8.7.1 Carnot Efficiency 

Many devices convert a temperature difference to another form of energy. 
For example, thermoelectric devices and pyroelectric devices convert a tem¬ 
perature difference to electricity, and Stirling engines and steam turbines 
convert a temperature difference to mechanical work. There is a funda¬ 
mental limit to the efficiency of any device that converts a temperature 
difference into another form of energy. The Carnot efficiency is the maxi¬ 
mum possible efficiency of such an energy conversion process. 

Consider a thermoelectric device made from a junction of two materi¬ 
als that converts a temperature difference to electricity using the Seebeck 
effect. Assume that one end of the device is connected to a heater, and the 
other end of the device is connected to a heat sink so that it is at a lower 
temperature. The temperature of the hot side of the device is denoted Th, 
and the temperature of the cold side of the device is denoted Tc. Both 
temperatures are measured in kelvins, K (or another absolute temperature 
measure such as Rankine). Assume that the only energy conversion process 
that occurs converts energy from the temperature difference to electricity. 
Furthermore, assume that energy is continuously supplied from the heater 
at a constant rate to maintain the hot end of the device at temperature 
Th- The heater is supplying heat to the room it is in. However, assume 
that the room is so large and the amount of heat from the heater is so 
small that the temperature of the room remains roughly constant. For this 
reason, we say that the room is a thermodynamic reservoir. Also, assume 
that we have waited long enough that the temperature of the device has 
reached a steady state. The temperature is not constant along the length 
of the device, but it no longer varies with time. 

The input to this system is the thermal energy supplied from the heater. 
Bin. The output of this system is the electrical energy extracted out. Bout- 
The device is not used up in the process, so the number of atoms in the 
device remains constant. As long as energy is supplied from the heater 
at a constant rate to maintain the hot side at temperature Th, we can 
extract electrical energy out of the system at a constant rate. Heat transfer 
scientists call this type of process a thermodynamic cycle or a heat engine. 
A thermodynamic cycle is a sequence of energy conversion processes where 
the device begins and ends in the same state. In a thermodynamic cycle, 
energy is supplied in one form and is extracted in another form. The device 
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or mass involved starts and ends in the same state, so the processes can 
continue indefinitely as long as the input is continually supplied. 

How much energy is supplied in to the system from the heater? The 
amount of energy required to maintain the hot side at temperature is 
given by 

Ein = ksTh- (8.37) 

The device is composed of atoms. Each of these atoms has some internal 
energy. A device at temperature T contains ksT joules of energy where 
ks is the Boltzmann constant. Energy fiows from the hot side to the cold 
side of the device. Above, we assumed that the device was in a room that 
was so large that the heat from the heater did not raise the temperature of 
the room. Thus, we must continually supply this energy at a constant rate 
to keep the hot side of the device at temperature Th. While the cold side 
of the device is at a lower temperature Tc, it maintains that temperature 
regardless of the fact that there is a heater in the room. 

How much energy is extracted out of the system as electrical energy? 
In the Seebeck device, the hot side is held fixed at temperature T/^, and 
because of the environment it is in, the cold side remains at temperature Tc. 
Energy is conserved in this system. Thus, the electrical energy extracted 
from the device is given by 


Eout — ksTh — ksTc- 


(8.38) 


What is the efficiency of this system? Above we assumed that no other 
energy conversion processes occur, so this is an idealized case. The result¬ 
ing efficiency that we calculate represents the best possible efficiency of 
a thermoelectric device operating with sides at temperatures Th and Tc. 
Efficiency is defined as 


expression. 


Veff 


Eout 

Veff — p • 

^in 

(8.39) 

. some algebra, we can 

simplify the efficiency 

Eout _ ksTh — ksTc 

(8.40) 

Ein kj^Th 

Th-To 

Veff — rp 

J-h 

(8.41) 

Tc 

Veff = 1 - W 

J-h 

(8.42) 
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8.7 Thermoelectric Efficiency 


Eq. 8.42 is known as the Carnot efficiency. It provides a serious limita¬ 
tion on the efficiency of energy conversion devices which involve converting 
energy of a temperature difference to another form. The Carnot efficiency 
applies to thermoelectric devices, steam turbines, coal power plants, py¬ 
roelectric devices, and any other energy conversion device that convert a 
temperature difference into another form of energy. It does not, however, 
apply to photovoltaic or piezoelectric devices. If the hot side of a device is 
at the same temperature as the cold side, we cannot extract any energy. If 
the cold side of a device is at room temperature, then the efficiency cannot 
be 100%. The Carnot efficiency represents the best possible efficiency, not 
the actual efficiency of a particular device because it is likely that other 
energy conversion processes occur too. We can extract more energy from a 
steam turbine with = 495 K than = 295 K. However, in both cases, 
the amount of energy we can extract is limited by the Carnot efficiency. 
Note that when using Eq. 8.42, Tc and Th must be specified on an abso¬ 
lute temperature scale, where T = 0 is absolute zero. In SI units, we use 
temperature in kelvins. 

As an example, consider a device that converts a temperature difference 
into kinetic energy. The cold side of the device is at room temperature, 
Tc = 300 K. How hot must the hot side of the device be heated to so that 
the device achieves 40% efficiency? 


Tc 

Veff = ^-7fr 
J-h 


(8.43) 


0.4 = 1 


300 


(8.44) 


According to Eq. 8.42, we find that Th = 500 K. 

As another example, suppose we want to convert a temperature dif¬ 
ferential to electrical energy using a thermoelectric device. Assume that 
the cold side of the device is at room temperature of Tc = 72 °E and the 
hot side is at human body temperature of Th = 96 °E . What is the best 
possible efficiency? Eirst the temperatures must be converted from degrees 
Eahrenheit to kelvins. The resulting temperatures are Tc = 295 K and 
Th = 309 K. Next, using Eq. 8.42, we find the best possible efficiency is 
only 4.5%. 

295 


rjcff = 1-= 0.045 

309 


(8.45) 


As another example, assume that the temperature outside on a De¬ 
cember day is = 20 °E and inside room temperature is Th = 72 °C. 
What is the Carnot efficiency of a thermoelectric device operating at these 
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temperatures? Again we begin by converting the temperatures to kelvins, 
Te = 266 K and Th = 295 K. 

266 

Veff = 1 ^ = 0-098 (8.46) 

8.7.2 Other Factors That Affect Efficiency 

The efficiency of practical energy conversion devices is always lower than 
the Carnot efficiency because it is very unlikely that only a single energy 
conversion process occurs. All practical materials, even good conductors, 
have a finite resistance, so energy is converted to thermal energy as charges 
travel through the bulk of the device and through wires connected to it. 
Eurthermore, heat flows through the device, so if a heater is connected to 
one side of a device, the other side will be at a higher temperature than 
the room it is in. Eor this reason, not all energy supplied by the heater can 
be converted to electricity. 

As an example, consider a material with length / = 1 mm = 10“^ m 
and cross sectional area A = 1 mm^ = 10“® m^. Assume the material has 
a resistivity of p = 10“^ f2-m which is typical for a moderate conductor. 
Assume a current of / = 3 mA flows through the sample. How much power 
is converted to heat due to resistive heating? The electrical conductivity 
of the sample is a = 4 = The resistance of the device is given by 

R = ^. Power is 

P = I^R = = (3 ■ 10“^)^ ——= 9 ■ 10“® W. (8.47) 

While this amount of power may seem small, it is another factor which 
diminishes the efficiency of the device. Even if we convert energy from a 
temperature differential to electricity at the junction of the thermoelectric 
device, some resistive heating occurs. This heat is wasted in the sense that 
it isn’t converted back to electricity. 

The efficiency of most thermoelectric devices is less than 10% [5, p. 140] 
|117|. As seen by Eq. 8.42, efficiency depends heavily on the temperatures 
Tc and T/^, and efficiency can be increased by increasing T^. Eor many 
devices, the maximum temperature is limited by material considerations 
including the melting temperature. 
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8.8 Applications of Thermoelectrics 

Thermoelectric devices are used to cool electronics, food, and people. Com¬ 
puter CPUs, graphics cards, and other types of electronics all generate heat, 
and these components can be damaged by excessive heat. Small thermo¬ 
electric devices can increase the reliability and lifetime of such compo¬ 
nents. Thermoelectric refrigerators have been used in RVs and submarines 
[3]. These devices are often less efficient than traditional refrigerators, but 
they can be small and quiet and require low maintenance. Some butter and 
cream dispensers in restaurants use thermoelectric devices to keep perish¬ 
able foods cool [118], and truck-sized thermoelectric refrigerators are used 
to keep pharmaceuticals cool [118]. Engineers have tried making air condi¬ 
tioning units out of these devices [110]. They are better for the environment 
than traditional air conditioning units which require freon or other chem¬ 
icals. However, they are not often used because the efficiencies are a few 
percent at best [110]. Thermoelectric devices have also been incorporated 
into military clothing to keep soldiers cool [118]. 

Thermoelectric devices are used both to make sensors and to control 
the temperature of sensing circuits. A thermocouple is a small thermo¬ 
electric device made from a junction of two materials that is used as a 
temperature sensor. It converts a small amount of energy from a temper¬ 
ature difference to electricity, and it can be used to measure temperature 
very accurately. Thermocouples are very common and often inexpensive. 
Thermoelectric devices are used to cool scanning electron microscopes and 
other types of imaging devices. Cooling is needed when imaging very small 
objects because heat causes atoms to vibrate, which can smear out micro¬ 
scopic images. Liquid nitrogen was used to cool imaging devices before 
thermoelectric devices became available, and it was much less convenient 
to use. The response of many types of sensors depend on temperature. 
A thermoelectric device may be part of a control circuit which keeps the 
sensor at a fixed temperature, so the sensitivity is accurately known. 

Thermoelectric devices are used to generate power for satellites and 
planetary rovers because thermoelectric devices have no moving parts and 
do not require regular refueling. The Mars rover Curiosity is powered 
by NASA’s Multi-Mission Radioisotope Thermoelectric Generator [119]. 
Figure 8.4 illustrates its major components. This power supply contains 
around 10 pounds of plutonium 238 in the form of plutonium dioxide. The 
plutonium decays naturally and produces heat. The heat interacts with a 
thermoelectric device and produces electricity, and the electricity is stored 
in a battery until use. The power supply produces around 2 kW of heat 
and around 120 W of electrical power, so the overall efficiency is around 
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Multi-Mission 

Radioisotope 

Thermoelectric 

Generator 

(expanded view) 


Stack of eight 
GPHS modules 



GPHS (General Purpose 
Heat Source) module 


Thermoelectric module 
(thermocouples) 


Eigure 8.4: Labeled pull-apart view showing the major components of the 
NASA Multi-Mission Radioisotope Thermoelectric Generator. This figure 
is used with permission [120]. 


6% [119]. This technology is not new. The Apollo 12 mission in 1969 used 
a similar type of power supply, but that supply produced only 70 W and 
had a lifetime of 5-8 years. Thermoelectric devices have also been used 
in nuclear power plants as a secondary system to recover some electricity 
from heat produced [5]. 

While thermoelectric effects are often fundamental to the operation of 
sensors and power supplies, the effects are sometimes unwanted [23, p. 457]. 
Electrical circuits contain junctions of wires made out of different metals. 
Such a junction occurs, for example, when an aluminum trace on a printed 
circuit board meets the tin wire of a resistor or when a tin lead solder joint 
meets a copper wire. The Seebeck effect occurs at all of these junctions. 
The Seebeck coefficient at a junction of copper and tin lead solder, for 
example, is 2 ^ [23, p. 457]. These unwanted voltages that develop can 
introduce noise or distortions into sensitive circuits. 

Electrical engineers often think of heat as “wasted energy”. Almost 
every electrical circuit contains resistors which heat up when current flows 
through them. In some applications, this heating is the desirable outcome. 
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8.8 Applications of Thermoelectrics 


For example, some train stations have heat lamps for the use in winter, 
and a concert hall on a winter evening fills with people and heats up from 
the bodies. However, usually the heat is just considered a waste product 
or a nuisance. 

In the long time limit, systems will reach an equilibrium temperature, 
but on short time scales, temperature differentials often exist. The inside 
of a car may be at a hotter temperature than the air outside. The air 
near an incandescent light bulb may be hotter than air elsewhere in a 
room, and so on. At one time in the past, we assumed that the earth 
had a nearly infinite amount of petroleum, coal, and other fossil fuels. 
Today, we know that these resources are finite. Recently, there has been 
increased interest in energy harvesting both for environmental reasons and 
for economic reasons, and thermoelectric devices can be used to convert 
this heat to usable electricity. 
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8.9 Problems 


8.1. In a 1 mm^ volume, 10^® atoms of argon are at a temperature of 
T = 300 K. Calculate the pressure of the gas. 

8.2. Argon gas is enclosed in a container of a fixed volume. At T = 300 K, 
the pressure of the gas is 50,000 Pa. At T = 350 K, calculate the 
pressure of the gas. 

8.3. A balloon is filled with helium atoms at room temperature, 72 °E. It 
has a volume of 5 ■ 10“^ m^, and the gas in the balloon has a pressure 
of 10® How many helium atoms are in the balloon, and what is 
the mass of the gas? 

8.4. A resistive heater is used to supply heat into an insulated box. The 
heater has current 0.04 A and resistance 1 kfl, and it operates for 
one hour. Energy is either stored in the box or used to spin a shaft. 
If the box gains 2,500 J of energy in that one hour, how much energy 
was used to turn the shaft? 

8.5. Qualitatively, explain the difference between each pair of related 
quantities. 

(a) Seebeck effect and Peltier effect 

(b) Thermal conductivity and electrical conductivity 
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8.6. Match the description with the quantity measured. 


A. Electrical conductivity 

B. Peltier coefficient 

C. Seebeck coefficient 

D. Thermal conductivity 

E. Thomson coefficient 

1 . A bar is made from a junction of two metals. A current of 1 
mA is placed through the bar. The temperature at each end of 
the bar is measured as a function of time. The rate of heat 
generated across the bar divided by the current is what 
quantity? 

2 . A bar is made from a junction of two metals. One end of 
the bar is held at a temperature of 20 °C while the other is 
held at 45 °C. The voltage between the ends of the bar is 
measured with a voltmeter. This voltage divided by 15 °C is 
what quantity? 

3. One end of a metal bar of is held at 45 °C while the other 
end is held at 20 °C. A current of 1 mA is placed through the 
bar. The rate of heat generated across the bar is measured. 

The rate of heat generated divided by the product of 1 mA 
and 1 °C is which quantity? 

4. One end of a metal bar of cross sectional area A is heated 
to a temperature of 45 °C. A thermocouple is placed 3 cm 
down the bar away from the heater. The product of the power 
dissipated in the heater times 3 cm divided by the product of 
A and temperature difference measured is what quantity? 

5. A current of 1 mA is put through a metal bar of cross 
sectional area A. The voltage drop across the bar is measured 
with a voltmeter. The current times the length of the bar 
divided by the product of the voltage measured and A is what 
quantity? 


8.7. Explain how to measure each of the following quantities, and list the 
tools needed to make the measurement. 


(a) Electrical conductivity 

(b) Thermal conductivity 

(c) Peltier coefficient 
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8.8. A thermoelectric device has a figure of merit oi Z = 0.7 K“^. A 
second device is made out of the same semiconducting materials, but 
it has been doped so that the electrical conductivity is 20% higher. 
Eind the figure of merit of the second device. 

8.9. A thermoelectric device is made from a material with resistivity 5 ■ 
10“® O-m and Seebeck coefficient 8.5 ■ 10“^ A cube, 1 cm on each 
side, was used to determine the thermal conductivity. One side of the 
cube was heated. At a steady state, the rate of energy transfer by 
conduction through the cube is 1.8 W. The temperature distribution 
through the material is linear, and a temperature difference across is 
measured to be 20 K across the cube. Eind the thermal conductivity 
K, and find the figure of merit Z for the material. 

8.10. As shown in Eig. 8.3, a heater supplies heat to one side of an iron rod. 

The rod is cylindrical with length 30 cm and radius 2 cm. The heater 
supplies 2 W of power to the edge of the rod. Iron has a thermal 
conductivity of k = 80 Two thermocouples are are spaced 15 

cm apart as shown in the figure. What is the difference in temperature 
(in degrees Celsius) measured between the two thermocouples? 

8.11. A thermoelectric device is used to build a small refrigerator that can 
hold two pop cans. When the device is operating, the cold side of 
the device is at T = 10 °C while the hot side of the device, outside 
the refrigerator, is at T = 42 °C. What is the maximum possible 
efficiency of this device? 

8.12. The cold side of a thermoelectric device, used to generate electricity, is 
at a temperature of 100 °C. What is the minimum temperature of the 
hot side of the device needed to achieve an efficiency of ? 7 e// = 15%? 

8.13. The Carnot efficiency describes the limit of the efficiency for some 
devices. Does it apply to the following types of devices? (Answer yes 
or no.) 


• Hall effect device 

• Semiconductor laser 

• Photovoltaic device 

• Piezoelectric device 

• Pyroelectric device 

• Thermoelectric device used as a temperature sensor 
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• Thermoelectric device used as a refrigerator 

• Thermoelectric device used to generate electricity for a sensor 
system 

8.14. The figures show Seebeck coefficient $(T), electrical resistivity p(T) = 
and thermal conductivity n(T) plotted versus temperature T for 
a family of materials known as skutterudites. These materials have 
the composition Tla;Co 4 _j^Fej^Sbi 2 and Tlj;Co 4 Sbi 2 -j,Sny where x and 
y range from zero to 1. The figures used with permission from ref¬ 
erence [121]. Recently, scientists have been studying the possibility 
of making thermoelectric devices from these materials. Using the 
data in the figures, approximate the thermoelectric figure of merit Z 
in units at a temperature of T = 200 K for the material with 
X = 0.1. 
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Eigure 8.5 

8.15. Consider the data in the figures of the previous problem over the range 
50 < T < 300 K for the material with x = 0.1. At what temperature, 
within this range, is the product of the figure of merit times the 
temperature, ZT, the largest, and what is the corresponding value of 
ZT? Show your work. 

8.16. The figures show the Seebeck coefficient, electrical resistivity, and 
thermal conductivity for three different materials. Assume that we 
would like to use these materials to build thermoelectric devices which 
generate electricity where the cold side of the device is slightly below 
T 400 K, and the hot side is slightly above T 400 K. The figures 
are used with permission from reference [122]. 


(a) Approximately, calculate the Peltier coefficient and the Thom¬ 
son coefficient for CeEe 4 Asi 2 near T ^ 400 K. 

(b) Assume you have a cube of CeEe 4 Asi 2 , 1 cm on each side. What 
is the resistance R of the cube? 

(c) What is the thermoelectric figure of merit Z for CeEe 4 Asi 2 near 
T 400 K. Include units in your answer. 

(d) All else equal, which of the three materials would produce the 
largest voltage for a given temperature difference. Justify your 
answer. 

(e) Which of the three materials has the largest thermoelectric fig¬ 
ure of merit? Justify your answer. 
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9 Batteries and Fuel Cells 

9.1 Introduction 

This chapter discusses two related energy conversion devices: batteries 
and fuel cells. A battery is a device which converts chemical energy to 
electricity, and one or both of the electrodes of the battery are consumed 
or deposited in the process. A fuel eell is a device which converts chemical 
energy to electricity through the oxidation of a fuel. The fuel, but not the 
electrodes, is consumed in the operation of a fuel cell. Oxidation is the 
process of losing an electron while reduetion is the process of gaining an 
electron. Both batteries and fuel cells contain three main components: an 
anode, cathode, and electrolyte. The electrode which electrons flow toward 
is called the eathode. The electrode which electrons flow away from is called 
the anode anode. The eleetrolyte is a material though which ions can flow 
more easily than electrons. 

In many ways, current technology is limited by battery technology. Eor 
example, the battery of the Apple iPhone X weighs 42 g and has a specific 
energy of 246 It accounts for 24 % of the weight of the phone [123] 

[124]. Similarly, the batteries of the Tesla Model S electric vehicle weigh 
580 kg and have an overall specific energy of 141 They account for 

27% of the weight of the car [125]. Relatedly, technology companies have 
been rocked by problems in battery manufacture. In July of 2015, more 
than half a million hoverboards produced by ten different companies were 
recalled due to battery explosions [126]. Also, Samsung recalled millions 
of Galaxy Note 7 smart phones in 2016, costing the company billions of 
dollars [127]. The batteries were manufactured by one of two different 
suppliers. Manufacturing issues in batteries produced by both suppliers 
made the phones susceptible to catching on Are [127]. 

Due to the importance of battery technology to the consumer product 
industry, electric vehicle industry, and other technology sectors, money and 
effort have been pouring into battery research, development, and manufac¬ 
turing. Rechargeable lithium ion battery development, in particular, is an 
intense area of effort and investment. All of the examples in the previ¬ 
ous paragraph involve these lithium batteries. In 2009, $13 billion worth 
of lithium batteries were sold, and 163 billion lithium batteries were pro¬ 
duced [128, ch. 15]. In 2014 Tesla, one of multiple manufacturers, began 
construction of a new factory named the Gigafactory. Upon completion, 
Tesla aims for this facility to be the largest building in the world and for 
it to annually produce lithium batteries with a combined capacity of 35 
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gigawatt hours [129]. More recently, industry-wide investment has only 
grown larger. 

9.2 Measures of the Ability of Charges to Flow 

The idea of flow of charges is fundamental to both electrical engineering 
and chemistry. However, electrical engineers and chemists make different 
assumptions, and they use different notations to describe closely related 
phenomena. Engineers prefer to work with solids because solids are durable. 
Electrical engineers assume all discussions involve solids unless otherwise 
specified. Chemists, however, are quite interested in, and assume all dis¬ 
cussions involve, liquids, with special focus on aqueous solutions. Batteries 
and fuel cells typically involve charge flow through both liquids and solids, 
so to understand these devices, we have to be familiar with notations and 
assumptions from both fields of study. 

In solid conductors, valence electrons flow. Inner shell electrons are 
assumed to be so tightly bound to atoms that their movements can be 
ignored. Nuclei are so much heavier than electrons that their movements 
can also be ignored. In solid semiconductors, both valence electrons and 
holes flow. Electrical engineers measure the ability of charges to flow in 
materials by the electrical conductivity. 

Positive and negative ions can flow more easily in liquids than solids, so 
chemists are concerned with the flow of both electrons and ions. Semicon¬ 
ductor physicists tend to use the terms electrical conductivity, resistivity, 
Eermi level, and energy gap. Chemists are so interested in the ability of 
charges to flow that they have many interrelated measures to describe it. 
We’ll discuss the following measures: 

• Mulliken electronegativity 

• Ionization energy 

• Electron affinity 

• Electronegativity 

• Chemical potential 

• Chemical hardness 

• Redox potential 

• pH 
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Solid Semiconductor Notation 
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Eigure 9.1: An energy level diagrams labeled in two ways. 


9.2.1 Electrical Conductivity, Fermi Energy Level, and Energy 
Gap Revisited 

Electrical conductivity a is measured in units and it was discussed in 
Sec. 8.6.2. The inverse is resistivity p = y measured in D ■ m. Electrical 
conductivity and resistivity are measures of the ability of electrons to flow 
through a material. As described by Eq. 8.32, electrical conductivity is 
directly proportional to the number of charges present and the mobility of 
the charges. Conductors have large electrical conductivity, and insulators 
have small electrical conductivity. These measures can describe liquids and 
gases as well as solids. Also, gases, liquids, and solids can all be classified 
as conductors, dielectrics, or semiconductors. 

Eermi energy level, energy gap, valence band, and conduction band 
were defined in Section 6.3. The left part of Eig. 9.1 shows an energy level 
diagram zoomed in so that only some levels are shown, and these terms 
are illustrated in the figure. At T = 0 K, energy levels are filled up to 
some level called the valence band. The energy level above it, which is 
unfllled or partially unfllled, is called the conduction band. The amount of 
energy needed to completely remove an electron from the valence band is 
represented by the vertical distance from that energy level to the ground 
state, labeled 0 eV, at the top of the figure. The energy gap, Eg, is the 
vertical distance between the valence and conduction bands. It represents 
the minimum amount of energy needed to excite an electron. The Eermi 
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energy level represents the energy level at which the probability of finding 
an electron is 0.5. At T = 0 K, it is at the middle of the energy gap. 
In the figure, it is shown as a dotted fine. Qualitatively, it represents 
the amount of energy needed to remove the next electron. No electrons 
have exactly that energy because there are no allowed states in the gap. 
For a doped semiconductor, a semiconductor with crystalline defects, or 
a semiconductor not at absolute zero temperature, the Fermi level is near 
but not quite at the middle of the gap. 

The right part of the figure shows the same energy level diagram la¬ 
beled using terms more commonly used by chemists to describe isolated 
atoms than by physicists to describe solid semiconductors. Chemists some¬ 
times use the term highest occupied energy level instead of valence band. 
This term is most often used to refer to energy levels of isolated atoms or 
molecules because some authors reserve the term band for an energy level 
shared between neighboring atoms. Similarly, chemists sometimes use the 
term lowest unoccupied energy level in place of conduction band. As dis¬ 
cussed below in Secs. 9.2.3 and 9.2.4, the term chemical potential fXchem 
is used in place of Fermi energy level Ef, and the energy gap Eg may be 
called twice the chemical hardness. 


9.2.2 Mulliken Electronegativity 


One measure that chemists use to describe the ability of charges to fiow is 
electronegativity, and this term has multiple definitions in the literature. 
One definition is by Mulliken in 1934 [130], and this measure will be referred 
to as the Mulliken electronegativity. Mulliken approximated the energy in 
a chemical bond by averaging the ionization energy Aomz and the electron 
affinity Aaff. Mulliken eleetronegativity is defined 

_ lioniz T Aajj /n 1 ^ 

X,Mulliken ^ 


Ionization energy is the energy needed to remove an electron from an atom 
or ion, and eleetron affinity is the energy change when an electron is added 
to an atom or ion [12]. All of these quantities, XMuiiiken, honizi and Aaff, 


are measured in the SI units of or occasionally in other units like 

eV _ kJ 


___ QJ’ __ 

This c^Snition is simpler than other definitions of electronegativity, and 
reference [131] calls this an “operational and approximate” definition. It is 
useful because it involves strength of chemical bonds, and we can relate it to 
the measures used by semiconductor researchers. Qualitatively, ionization 
energy is represented by the energy needed to rip off an electron. In Fig. 
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9.1, it is the vertical distance from the valence band or highest occupied 
state to the ground state at the top of the figure. Sometimes chemists 
call this amount of energy the work function instead [60, ch. 6] [108]. In 
Eig. 9.1, the electron affinity is represented by the vertical distance from 
the conduction band or lowest unoccupied state to the ground state at the 
top of the figure. The magnitude of the Mulliken electronegativity is the 
average of these two energies, so it is the magnitude of the Eermi energy 
at T = 0 K. By convention, it has the opposite sign. 

\Mulliken ■^/It=0 K (^■^) 

Eundamentally, electrical engineering is the study of flow of charges. 
Chemistry is the study of the strength of chemical bonds. The electrical 
conductivity of a material is high when the chemical bonds holding that 
material together are easily broken so that many free charges can flow. 
The electrical conductivity of a material is low when chemical bonds hold¬ 
ing atoms together require lots of energy to break. Electronegativity is a 
measure of the energy required to break chemical bonds, so fundamentally, 
it tells us similar information to electrical conductivity. 


9.2.3 Chemical Potential and Electronegativity 


Another way of defining electronegativity follows the definition introduced 
by Pritchard in 1956 [132]. This definition is one of the more common ones, 
and it is used by both chemists [131] [133] and by other scientists [2, p. 
124.]. The electronegativity of an atom is defined as 


X = - 





(9.3) 


where U is the internal energy relative to a neutral atom and N repre¬ 
sents the number of electrons around the atom. An atom is composed of a 
charged nucleus and charged electrons moving around the nucleus, so there 
is an electric field, and hence an electrical potential V in volts, around 
an atom. This potential significantly depends on the number of electrons 
around the atom. Also, when the atom is at a temperature above abso¬ 
lute zero, the electrons and nuclei are in motion, so the atom has some 
entropy S. Electronegativity involves at constant electrical potential 
and entropy. It applies whether the atoms are part of a solid, liquid, or 
gas. 

The chemical potential fichem is defined as the negative of this elec¬ 
tronegativity. 

fJ^chem = -X (9.4) 
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In SI units, both chemical potential and electronegativity are measured 
in but sometimes they are also expressed in or As if 

the three names, chemical potential, negative of the electronegativity, and 
Fermi energy level, weren’t enough, this quantity is also known as the 
partial molar free energy [60, p. 145]. 

Electronegativity is used to describe a collection of atoms, molecules, 
or ions all of the same ionization state [131]. Less energy is required to 
rip the first electron off an atom than the second or third electron. The 
definition of electronegativity is specific to potential V, in volts, due to the 
nucleus and electrons around an atom. For example, we can talk about 
the electronegativity, energy required to rip off the electron, of a neutral 
magnesium atom. We can also talk about the electronegativity, energy 
required to rip off an electron, from a Mg’*" ion. The electric field, and 
hence potential V, around a neutral Mg atom and the electric field, and 
hence potential V, around a magnesium ion Mg’*" are necessarily different 
because of the number of electrons present. The energies required to rip 
off the next electron from these atoms will also necessarily be different. 
So, electronegativity of a material always refers to a specific ionization 
state. Electronegativity incorporates both the energy required or gained 
by ripping off an electron and the energy required or gained by acquiring an 
electron. Qualitatively, it is the average of the ionization energy required 
to rip off an electron and the electron affinity released when an electron 
is captured. In the case of the Mg atom from the example above, the 
energy gained by releasing an electron is the significant term, but that is 
not always the case. 

In most energy conversion devices, and most chemical reactions, we are 
interested in only the valence electrons. So, even if an atom has dozens 
of electrons around it and the energy to rip off each electron is different, 
we are just interested in the first few valence electrons. We will see that 
batteries and fuel cells involve energy stored in chemical bonds. Only 
the valence electrons are involved in the reactions of batteries and fuel 
cells, so in studying batteries and fuel cells, we are most interested in the 
electronegativity of neutral or singly ionized atoms. 

Equation 9.3 defines electronegativity as the energy required to rip off 
the next electron from the atom. Again consider Eig. 9.1. The energy level 
known as the valence band to semiconductor physicists and the highest 
occupied state to chemists is filled with electrons. The next highest band, 
called either the conduction band by semiconductor physicists or lowest 
unoccupied state by chemists, is not filled with electrons. The electronega¬ 
tivity according to this definition is the energy required to rip off the next 
electron. On average, it is again graphically represented by the Eermi level. 
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Both electronegativity defined by Eq. 9.3 and Mulliken electronega¬ 
tivity defined by Eq. 9.1 have the same units. However, multiple other 
definitions of electronegativity can be found in the literature. One of the 
oldest definitions is due to Pauling in 1932 [134], and that definition is 
measured instead in the units of square root of joules on a relative scale. 
Reference [135] expanded on Pauling’s definition to show variation with 
ionization state and atom radius. Reference [133] also contains a different 
definition of electronegativity also with its own units. 

9.2.4 Chemical Hardness 

Chemists sometimes use the term hardness when semiconductor physicists 
would use the term half the energy gap. Chemical hardness has nothing to 
do with mechanical hardness. As with electronegativity, there are multiple 
related definitions of hardness. The Mulliken hardness is defined as [131] 

Mulliken hardness = ^ (9.5) 

A more careful definition of chemical hardness is [131] [136, p. 93] 

hardness = i (5^) , (9.6) 

It is half the change in chemical potential for the next electron, and quali¬ 
tatively it is represented by half the energy gap. As with electronegativity, 
it is specified for a given potential in volts around the atom and a given 
entropy. Liquids may be classified as hard or soft. Hard acids and hard 
bases have large energy gaps, so they are electrical insulators. Soft acids 
and soft bases have small energy gaps, so they are electrical conductors. No 
additional variable will be introduced for hardness because this quantity 
can be represented by half the energy gap, 

9.2.5 Redox Potential 

Redox (from REDuction-OXidation) potential Vrp is yet another measure 
used by chemists to describe the ability of electrons to be ripped off their 
atoms and flow in the presence of an applied voltage, nearby chemical, op¬ 
tical field, or other energy source. As defined above, the process of ripping 
off electrons is called oxidation. The process of gaining electrons is called 
reduction. Together, they form redox reactions. Instead of being measured 
in joules like electronegativity, it is measured in volts where a volt is a joule 
per coulomb. Redox potential represents the energy stored in a chemical 
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bond per unit charge. It is more often used by experimentalists than the¬ 
orists, and it is often used to describe solids instead of liquids. Redox 
potential is a macroscopic property, describing a larger piece of material as 
opposed to describing just an individual atom. It is also sometimes called 
oxidation reduction potential or the standard electrode potential [137]. It 
is a relative measure of the ability of a substance to lose an electron. A 
list of redox potentials can be found in references [60, p. 158] and [137]. 
There are different ways of defining redox potential in the literature. The 
definitions vary in their choice of a ground reference voltage, and they vary 
in their sign conventions. American and European researchers tend to use 
different definitions. 

Redox potential is measured on a relative scale. To measure redox 
potential [138], electrodes are put in the system being studied. A potential 
is applied to balance the internal voltage. By measuring this externally 
applied voltage, the potential of an electrode is determined with respect 
to a reference electrode. Often, the potential of a platinum electrode is 
used as a reference and said to have zero volts at standard conditions of 
T = 25 °C and P = 1 atm. The reaction at the platinum electrode is 

H2^2H+ + 2e-. (9.7) 


9.2.6 pH 

pH is a unitless measure of the likelihood that a water molecule is bonded 
or has been ionized in a liquid solution. It is used to classify liquids as 
acidic or basic. When discussing pH, we assume the material under test is 
a liquid solution at a temperature of 25 °C and a pressure of 1 atm [12] 
[81]. A liquid solution is a mixture of water and another material called a 
solute. More specifically, pH is defined as 


pH 



(9.8) 


The quantity [H^] is the amount concentration of hydrogen ions in the 
units of [68, p. 39]. 

concentration H'''ions, mol , , 

[H’'"] =_^_ (9.9) 

^ volume of solution in L 

This quantity was formerly called molarity or molar concentration, but 
these terms are no longer recommended for use [68, p. 39]. pH is a measure 
often used by experimentalists. 
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The concept of pH is fundamentally related to the flow of charges, a 
concept which is very important to electrical engineers. Water is composed 
of H 2 O molecules. In pure water, some of these molecules fall apart, ionizing 
into H’*' ions (protons) and OH“ ions. However, most of the molecules 
remain intact. If some solutes are mixed with the water, more of the H 2 O 
molecules will ionize than in pure water. Eor example, carbon dioxide will 
bond with OH~ ions forming carbonic acid HCO 3 causing an increase in 
H'^ ions. Since pH is the negative log of H^ ion activity, increasing H^ ion 
concentration is equivalent to a pH decrease. If ammonia, NH 3 , is added to 
water, NHj is formed, and the number of OH“ ions increases resulting in 
an increase in pH. Since water is a liquid, both these positive and negative 
ion charge carriers can move about relatively easily. If an external voltage 
is applied across the liquid, ions will flow. Electrical conductivity will be 
higher in a liquid with more ions present than in liquids with fewer ions 
present. 

As an example, consider what happens when neutral sodium atoms are 
added to water. (Eor obvious reasons, don’t try this at home [139].) It is 
energetically favorable for the sodium atoms to ionize to Na’*' giving up an 
electron. In the process, some more water molecules ionize, and some H"*" 
ions become neutral H atoms. 

Na + H 20 ^Na+ + H + OH" (9.10) 

By adding the solute sodium, the solution has fewer H’*' ions. 

Consider what happens when neutral chlorine atoms are added to water. 
It is energetically favorable for a chlorine atom to acquire an electron to a 
form Cl~ ion. 

Cl + H20^Cr+H+ + OH (9.11) 

By adding the solute chlorine, the solution has more H"^ ions. While these 
examples involve adding neutral atoms, the concept of pH applies to solutes 
which are molecules too. 

Solutions with pH less than 7 are called acidic. If a solution has a high 
concentration of H^ ions, it will have a low pH and be acidic. In strongly 
acidic solutions, molecules of the solute rip apart many water molecules, 
so lots of ions are present. Solutions with pH greater than 7 are called 
alkaline or basic. If a solution has a low concentration of H^ ions, and 
hence a high concentration of OH“ ions, it will have a high pH and be 
alkaline. In strongly alkaline solutions, molecules of the solute rip apart 
many water molecules, so again lots of ions are present. Neutral solutions 
have a pH near 7, and some neutral solutions may be electrical insulators. 
Solutions with a pH much below or much above 7 necessarily have many 
ions present, and they are good electrical conductors. 
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As an example, let’s find the pH of a solution with 10^^ ions of H"'’ in 1 
L of water. 


H+ = 


10 ^^ ions 


1 L 6.022 ■ 1023 lOfif 
mol 


= 1.66 ■ 10 


-9 


mol 


(9,12) 


Notice that the exponent of [H’*'] is -9, and the pH is 9. Qualitatively, the 
pH tells us the negative of the order of magnitude of the amount concen¬ 
tration of hydrogen ions. The solution in this example is alkaline. 

As a related example, let’s find the pH of a solution with 10^° ions of 
H'*' in 1 L of water. 



10 ^° ions 

n 


1 

6.022 ■ 1023 


ions 

mol 


1.66 ■ 10 “^ 


mol 

IT 


(9.14) 


pH = log f- - -T ) 4 (9.15) 

^ ^Vl-66-lO-V 

This example has more hydrogen ions in the solution, so it is more acidic. 
The pH of 4 tells us that the solution has approximately 10“^ of 
hydrogen ions. 

How many hydrogen ions are found in a 1 L solution with a pH of 7? 


10“^ — ■ 1 L ■ 6.022 ■ 10^^ 


7 = log 

[H+] = 10 
ions 


|Hl) 

(9.16) 

_j ions 
mol 

(9.17) 

= 6.022 ■ 10^® ions H+ 

(9.18) 


L mol 

A neutral solution, with a pH of 7, still contains H^ ions. 



9 BATTERIES AND EUEL CELLS 


211 


9.3 Charge Flow in Batteries and Fuel Cells 
9.3.1 Battery Components 

The flow of both positive and negative charges must be considered to un¬ 
derstand the operations of batteries and fuel cells. The simplest battery 
contains just an anode, cathode, and electrolyte. These components are 
illustrated in Eig. 9.2. 



Eigure 9.2: Battery components. 

Both of the electrodes must be good conductors. They are often porous 
to increase the surface area where the reaction occurs. The cathode is a 
sink for electrons and positive ions, and both of these types of charges are 
attracted towards this terminal. The cathode is the positive electrode of 
a discharging battery. The anode is source for electrons and positive ions, 
and both of these types of charges flow away from the anode. The anode 
is the negative electrode of a discharging battery. 

The electrolyte has high ionic conductivity but low electrical conductiv¬ 
ity. Eor this reason, during discharge of a battery, ions flow from the anode 
to the cathode through the electrolyte. Meanwhile, electrons are forced 
to flow from the anode to the cathode through the load. The electrolyte 
is often a liquid but sometimes a thin solid. Batteries are contained in a 
package. If the electrolyte is liquid, a seal is included to prevent it from 
spilling or escaping [140]. 

Most batteries also contain a separator, which is typically made from 
a thin polymer membrane [140]. The separator allows some but not other 
ions to flow through, and it is a physical barrier that prevents the electrodes 
from contacting and shorting out the battery. 
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Battery components 


Optional components for protection 

Anode 


Diode 

Cathode 


Fuse or circuit breaker 

Electrolyte 


Vent 

Separator 


Microcontroller 

Seal 


Thermocouple 

Table 9 

1 .1: 

Battery Components. 


Additional components are often added to improve device safety, and 
Table 9.1 lists some of these optional components. A user may mistakenly 
insert a battery backwards. To prevent damage due to this error, some bat¬ 
teries incorporate a diode [128, ch. 5.1]. The voltage across the terminals 
of a battery with an internal diode will necessarily be less than the voltage 
across an equivalent battery without the diode present. Other batteries, 
like typical 9V batteries, incorporate connectors that can only be attached 
one way. A battery may also be damaged if the terminals are shorted. 
Most batteries include vents so gases can safely escape when a battery is 
damaged due to shorting the terminals, attempting too much current draw, 
or overheating for other reasons [128, ch. 5.1]. Some batteries include a 
fuse or circuit breaker in the package to prevent damage in these cases too. 
Additionally, rechargeable batteries can be damaged if the recharging pro¬ 
cess is not properly controlled [128, ch. 5.1]. Some rechargeable batteries 
have a thermocouple and microcontroller built into the package to control 
the recharging process and prevent overheating during recharging [128, ch. 
5.1]. Users should not try to recharge nonrechargeable batteries. While the 
chemical reaction can often go in either direction, the package and structure 
of a primary battery are not designed to withstand the charging process 
and will typically be damaged [128, ch. 5.1]. 

9.3.2 Charge Flow in a Discharging Battery 

As a battery discharges, chemical energy stored in the bonds holding to¬ 
gether the electrodes is converted to electrical energy in the form of current 
flowing through the load. Consider an example battery with a magnesium 
anode and a nickel oxide cathode. The reaction at the anode is given by 

Mg + 20H-^ Mg(OH)2 + 2e- (9.19) 

which has a redox potential of Vrp = 2.68 V [137] [140]. The reaction at 
the cathode is given by 

NiOa + 2 H 2 O + 2e“ ^ Ni(OH )2 + 20H~ 


(9.20) 
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Eigure 9.3: Charge flow in a discharging battery. 

which has a redox potential of Vrp = 0.49 V [140]. The overall reaction is 
given by 

Mg+Ni02 + 2 H 2 O ^ Mg(OH)2+Ni(OH)2 

This reaction occurs in alkaline solutions that contain OH“ ions available 
to react, so an electrolyte such as potassium hydroxide, KOH, can be used 
[140]. Other reactions may simultaneously occur at these electrodes [137], 
but for simplicity these other reactions will be ignored. 

Eigure 9.3 illustrates the charge flow in the battery during normal oper¬ 
ation. A complete circuit is formed not just by the flow of electrons but by 
a combination of the flow of electrons and ions [128]. Electrons flow away 
from the negative terminal (anode) through the load. Negative OH~ ions 
flow away from the positive terminal (cathode) through the electrolyte. 
The separator should allow the OH“ to flow from the positive terminal to 
the negative terminal. Eor some electrodes, though not in this example, 
positive ions, instead of negative ions, complete the circuit by flowing away 
from the negative terminal. As shown in the figure, the direction of current 
flow is opposite to the direction of electron flow. The battery continues to 
discharge until one of the electrodes is used up [3, p. 226]. 

9.3.3 Charge Flow in a Charging Battery 

Eigure 9.4 illustrates the flow of charges when the battery is charging. Dur¬ 
ing charging, energy is converted from electrical energy due to the external 
voltage source back to chemical energy stored in the chemical bonds hold¬ 
ing together the electrodes. Again, the flow of both electrons and ions, not 
just electrons, must be considered. As above, the direction of the current 
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NiO , 



Figure 9.4: Charge flow in a charging battery. 

is the opposite of the direction of the flow of electrons. Reactions occurring 
are the opposite of the reactions given by Eqs. 9.19 and 9.20. By deflnition, 
the cathode is the electrode which electrons flow towards, and the anode 
is the electrode which electrons flow away from. During charging, unlike 
during discharging, the cathode is the negative terminal and the anode is 
the positive terminal. For this example, the reaction at the cathode is 

Mg(OH)2 + 2e-^ Mg + 20H" (9.21) 

and the reaction at the anode is 

Ni(OH )2 + 20H- ^ NiOs + 2 H 2 O + 26“. (9.22) 

In this example, OH' ions flow away from the cathode during charging. 
However, in some reactions, both the flow of negative ions away from the 
cathode and positive ions away from the anode must be considered during 
charging. 

9.3.4 Charge Flow in Fuel Cells 

A fuel cell contains many of the same components as a battery [3, p. 226] 
[128, p. 376] [141]. Like a battery, a fuel cell contains an anode and a 
cathode. These electrodes must be good conductors, and they are often 
porous so that they have a large surface area. Electrodes are in a liquid 
or solid electrolyte through which ions can flow. The electrodes are often 
coated in a catalyst, such as platinum, to speed up chemical reactions 
|141|. A fuel cell contains a separator, typically called a membrane, which 
selectively allows ions to flow. As with the separator of a battery, it is 
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Fuel 



" Oxidizer 

► Oxidation 
Products 


Eigure 9.5: Charge flow in a fuel cell. 


typically made from a thin polymer. Enel is added at the anode, and an 
oxidizer is added at the cathode. Typically, both the fuel and oxidizer 
are liquids or gases. They get consumed during operation while the anode 
and cathode are not consumed as they are in a discharging battery. These 
components are illustrated in Eig. 9.5. 

As an example, a fuel cell may use H 2 gas as the fuel and O 2 gas as the 
oxidizer. The anode may be carbon cloth [141], and this reaction is sped 
up by a platinum catalyst [108]. An alkaline solution such as KOH can be 
the electrolyte. Eor this fuel in an alkaline electrolyte, the reaction at the 
anode is 

H 2 + 20H~ ^ 2 H 2 O + 2e“ (9.23) 

and the reaction at the cathode is 

Aj + 2 e- + H2O ^ 20 H- ( 9 . 24 ) 


| 108 |, 

Eigure 9.5 also illustrates charge flow in an example fuel cell [3, p. 226] 
[128, p. 376] [141]. Oxidation, the process of ripping electrons off the fuel 
leaving positive ions, occurs at the anode. These electrons flow from the 
anode to the cathode through the load. At the cathode, the oxidizer is 
reduced. In other words, at the cathode, the oxidizer reacts incorporating 
these electrons to form negative ions. These negative ions flow from the 
cathode to the anode, and positive ions flow from the anode to the cathode. 
The membrane prevents charges from flowing in the reverse direction, and 
it prevents the positive ions and negative ions from combining with each 
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other directly. A fuel cell can continue to operate as long as the fuel and 
oxidizer are added and the oxidation products are removed. 


9.4 Measures of Batteries and Fuel Cells 

9.4.1 Cell Voltage, Specific Energy, and Related Measures 

Just as chemists have multiple measures of the ability of charges to flow, 
they have multiple measures of energy or charge stored in a device. In this 
section, the following measures of batteries and fuel cells are deflned: 


Cell voltage in volts 
Speciflc energy in g or 

Energy density in ^ or 
Capacity in mA-h or C 
Speciflc capacity in or 


Charge density in "“V" “ or ^ 


mA-h C 
L 


kg 


Deflnitions throughout this section follow references [128, ch. 1] and [140]. 

If these measures are calculated using knowledge of chemical reactions 
and quantities found in the periodic table, they are called theoretical values. 
If these quantities are experimentally measured, they are called practical 
values. Practical values are necessarily less because no energy conversion 
device is ever completely efficient. Measures preceded by the word specific 
are given per unit mass. Measures followed by the word density are give 
per unit volume. For example, specific energy is measured in the SI units 
of joules per gram and energy density is measured in the SI units of joules 
per meter cubed. However, these rules are not closely followed, so the term 
energy density is sometimes used to mean energy per unit weight instead 
of per unit volume. It is safest to explicitly specify the units of measure to 
avoid this confusion. 

Theoretical cell voltage^ Vceii measured in volts, is the voltage between 
the anode and the cathode in a battery or fuel cell. It is the sum of the 
redox potential for the half reaction at the anode and the redox potential 
for the half reaction at the cathode. It represents the voltage between the 
terminals of a completely charged battery or fuel cell. Many authors call 
this measure theoretical cell potential instead of cell voltage, and symbols 
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or are also used in the literature. As discussed in Appendix C, the 
word potential is overloaded with multiple meanings. The word voltage 
and the symbol Vceii are used here to emphasize that this quantity is essen¬ 
tially voltage. Since redox potentials for many half reactions are tabulated 
[128, app. B] [137], theoretical cell voltage can be quickly calculated for 
many reactions. While we can calculate the theoretical cell voltage, we can 
measure the practical cell voltage with a voltmeter. The theoretical cell 
voltage will always be slightly larger than the practical cell voltage because 
the theoretical cell voltage ignores a number of effects including internal 
resistance and other factors discussed in the next section. Reactions with 
Vceii > 0 occur spontaneously [12, ch. 18]. 

Three related measures are capacity, specific capacity, and charge den¬ 
sity. Capacity is measured in ampere hours or coulombs. (By definition, 
one ampere is equal to one coulomb per second.) It is a measure of the 

charge stored in a battery or fuel cell. Specific capacity is a measure of 

m A Vi 

the charge stored per unit mass. It is specified in g , ^ , or related 
units. Charge density is a measure of the charge stored per unit volume, 
and it is specified in £ , or related units. While capacity depends 
on the amount of material present, specific capacity and charge density do 
not. All of these measures may be specified as theoretical values calcu¬ 
lated from knowledge of the chemical reactions involved or practical values 
measured experimentally where the theoretical values are always slightly 
higher. Also for all of these values, only valence electrons are considered. 
Batteries and fuel cells necessarily have more electrons than are included 
in these measures because inner shell electrons, which do not participate in 
the chemical reaction, are ignored. Energy is stored in the bonds holding 
inner shell electrons, but this energy is not converted to electricity in bat- 
teries or fuel cells. The concept of charge density, pch in units was first 
introduced in section 1.6.1, and it shows up in Gauss’s law, one of Maxwell’s 
equations. However, the word capacity has nothing to do with the word 
capacitance introduced earlier. See Appendix C for more information on 
this and other overloaded terms. 

Theoretical specific energy is measured in g , or related units 

[128, ch. 1]. It is a measure of the energy stored in a battery or fuel 
cell per unit weight. It is the product of the theoretical cell voltage and 
the specific charge. Belatedly, theoretical energy density, measured in 

or is a measure of the energy stored in a device per unit volume. 

Theoretical energy density is the product of theoretical cell voltage and 
charge density. These measures can be calculated from knowledge of the 
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chemical reactions involved using information found in the periodic table. 
Practical specific energy and practical energy density are typically 25-35% 
below the theoretical values [128, ch. 1.5]. Specific energy and energy 
density are important measures of a battery. Often, high values are desired 
so that small and fight batteries can be used to power devices for as long 
as possible. However, as specific energy and energy density increase, safety 
considerations increase. 

Chemists sometimes define the charge in a mol of electrons as the Fara¬ 
day constant. It has the value 


6.022 ■ 10^3 atoms 1 e" 1.602 ■ lO-^^ C 
1 mol atom e~ 


9.649 ■ 10^ 


C 

mol 


(9.25) 


[68]. This quantity will not be used below because the Avogadro constant 
Na and the magnitude of the charge of an electron q are already specified 
and because this text already has too many variables. 

A h 

We can calculate the theoretical specific capacity in and the the¬ 
oretical specific energy in g for the reactions given by Eq. 9.19 and 9.20. 
The redox potential for the Mg half reaction is Wp = 2.68 V, and the redox 
potential for the Ni half reaction is Wp = 0.49 V [140] [137]. The overall 
cell voltage is 

Wezz = 2.68 + 0.49 = 3.17 V. (9.26) 


The reaction occurs spontaneously when it is set up because Wezz > 0. 

By unit conversions, we can calculate the weight per unit charge for 
each half reaction. From the periodic table, the atomic weight of Mg is 
24.31 the atomic weight of Ni is 58.69 and the atomic weight 

of O is 16.00 First consider the Mg half reaction of Eq. 9.19 which 
involves two v^ence electrons. 


24.31 ^ 
mol 

= 0.454 


1 mol 

6.022-1023 atoms 


1 atom 
valence e- 


1 e- . 1 C 

1.602-10-19 C 1 A-S 


3600 S 

1 h 


AT 


(9.27) 


Next, consider the Ni half reaction of Eq. 9.20 which also involves two 
valence electrons. The weight of Ni02 is 90.69 


90.69 


mq 


= 1.69 


AT 


1 mol 

6.022-1023 atoms 


1 atom 
2 valence e- 


Eor the overall reaction. 


1 e- _ 1 C . 3600 S 

1.602-10-19 C 1 A-S 1 h 

(9.28) 


g 

AT' 


0.454 + 1.692 = 2.146 


(9.29) 
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The overall theoretical specific capacity is the inverse of this quantity. 


1 

2.146 


0.466 


A-h 


(9.30) 


Adding charge densities for each half reaction does not make sense, but we 
can sum the terms for weight per unit charge in unit 

We can calculate the theoretical specific energy by multiplying the the¬ 
oretical cell voltage and the theoretical specific capacity. 


3.17 V-0.466 


A-h 

g 


1.48 


W-h 

g 


(9,31) 


The theoretical specific energy can be converted to the units 


J 

g' 


^ W-h 1 J 3600 s 

1.48-- 

g 1 W-s 1 h 


o J 

5.32 - 10^ - 
g 


(9.32) 


In the calculation above, only the electrode weights were considered. How¬ 
ever, the package, separator, and other battery components all have some 
mass which contribute to the weight of the battery. 


9.4.2 Practical Voltage and Efficiency 

We can model both a battery and a fuel cell as an ideal voltage source. This 
is a useful model, but at times, it is not good enough for multiple reasons. 
A better model includes some internal resistance [128, p. 9.27]. However, 
even this model is inadequate because the voltage of any practical battery 
depends on temperature, the load, the current through the battery, the 
fraction of capacity used, the number of times it has been recharged, and 
other factors [128, p. 3.2]. An even better model includes these variations 
too, as shown in Eig. 9.6. 


Simple battery 
model 


V 


A more accurate 
battery model 


(^n(t,T,i) 


Eigure 9.6: Models of a battery. 

There are many measures used to describe the voltage across a battery 
or fuel cell. The nominal voltage is the typical voltage during use, and it 
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is often the voltage printed on the label. The end or cutojf voltage is the 
voltage at the end of the battery’s useful life. The open eireuit voltage is 
the voltage under no load, and it is approximately the initial voltage of the 
battery. The elosed eireuit voltage is the voltage under load. It is less than 
the open circuit voltage due to the internal resistance of the battery [128, 
p. 3.2]. 

All batteries and fuel cells have some internal resistance. The cathode 
and anode are made of metals which are good, but imperfect, conductors. 
For example, carbon is a common electrode material, and it has an electrical 
conductivity between 1.6 ■ 10^ and 2.0 ■ 10^ [106]. Anytime current 

flows through a physical material with flnite electrical conductivity, energy 
is converted to heat. Actual voltage is a function of current drawn from 
the battery because at high currents, this effect is larger. Also, the actual 
voltage is a function of temperature because ions move faster at higher 
temperatures, so there is less internal resistance at higher temperatures 
[128, p. 3.9]. However, at higher temperatures, chemical reactions may 
occur more quickly, so the life of the batteries may be less because reactions 
occur faster. 

The actual voltage across a battery or fuel cell is also influenced by the 
accumulation of chemical reaction products. In the example given by Eqs. 
9.19 and 9.20, the reactants were Mg and Ni02 and the reaction products 
were Mg(OH )2 and Ni(OH) 2 . The actual voltage across the device decays 
with use because reactants build up in the electrolyte as the reaction occurs. 
These reactants inhibit further reactions from taking place [128, p. 3.2]. 

The effect of the accumulation of products on the voltage of a battery 
can be modeled by 




cell theor 


Vr 


cell prac 


ksT j / [products] \ 


N^q 


. (9.33) 

\ [reactants] / 


which is known as the Nernst equation [12, p. 750,789]. Many authors re¬ 
place the Boltzmann constant in this expression using M = Naks where Na 
is the Avogadro constant and M is the molar gas constant. In this expres¬ 
sion, Vceii theor IS the theoretical cell voltage, and Vceii prac is the practical 
cell voltage that incorporates the effect of reaction products. The quantity 
Ny represents the number of valence electrons involved in the chemical re¬ 
action. For the example of Eqs. 9.19 and 9.20, two electrons are involved. 
So, the quantity represents the internal energy per valence electron 

involved in the reaction. The quantity Ireartants ] 
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quotient, and its natural log is between zero and one. 


0 < In 


/ [products] \ 
\ [reactants] J 


< 1 


(9.34) 


When a battery is first set up, there are many reactants but few products 
present, and 


In 


/ [products] \ 
\ [reactants] / 


0 . 


(9.35) 


In this case, the activity quotient is very small, so the practical cell voltage 
between the terminals is very close to the theoretical cell voltage. After a 
battery has been discharging for a long time, the activity quotient is large 
because many products are present. 


, / [products] \ 

V[reactants]; ~ 


(9.36) 


As expected, this model shows that as a battery discharges, the difference 
between the theoretical and practical cell voltage grows. We cannot ever 
use the entire capacity stored in a battery. As the battery discharges, the 
voltage between the terminals drops. At some point, the voltage level is 
too low to be useful, and the end voltage is reached. At this point, the 
battery should be replaced even though it still has some stored charge. 

The Nernst equation is useful to chemists because it can be used to 
solve for the amount concentration of reaction products and reactants. 
The theoretical cell voltage can be calculated or found in a table, and the 
practical cell voltage can be measured with a voltmeter. Reference [137] 
tabulates components of the activity quotient as a function of temperature 
for various reactions. 

Electrical engineers may be more interested in the Nernst equation be¬ 
cause it gives information on the efficiency of batteries and fuel cells. Effi¬ 
ciency is defined as the output power over the input power or the output 
energy over the input energy. 


Veff 


E. 


out 


Ein 


(9.37) 


Energy stored in an electrical component is given by Eq. 2.8 where Q is 
charge and V is voltage. The amount of charge involved in each reaction is 
given by number of electrons involved times their charge for each, Q = qNy. 


Ein ^qElyVyyii theor 


( 9 . 38 ) 
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Internal energy of a reaction at temperature T is also given by 


F'in 


1 

2 


ksT. 


(9.39) 


We can model the theoretical voltage of a battery cell by combining Eqs. 
9.38 and 9.39. 

ksT = qNyVcell theor (9.40) 

/ n~' 

Vcell theor = “(9.41) 
qNv 

The output energy produced by the battery is proportional to the prac¬ 
tical cell voltage measured between the terminals. 


Eout - ^qN.Vcell prac 


The efficiency can then be rewritten. 


Veff = 


^cell 


prac 


Vcell 


theor 


(9.42) 


(9.43) 


With some algebra, we can use the Nernst equation to write this quantity 
as a function of the activity quotient. 


Veff = 


Vr 


cell prac 


Vr 


cell theor 


-V 


cell theor 


Veff = 1 


^cell theor 
^cell theor - Vcell 


prac 


Vcell te 

The numerator can be replaced using the Nernst equation. 

1 


Veff = 1 


ksT j / [products] \ 
Vcell theor \^kVyq y [rcactauts] y 


(9.44) 

(9.45) 


(9.46) 


Equation 9.47 shows that the efficiency is a function of the activity quotient. 
As described above, the activity quotient is different for different reactions, 
and it varies with temperature. The activity quotient is a measure of the 
effect of the accumulation of products in the electrolyte of a battery or fuel 
cell. 

Equation 9.47 describes the efficiency of batteries and fuel cells. It is 
another way of expressing the Nernst equation. It is analogous to equa¬ 
tions we have encountered describing efficiency of other energy conversion 
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devices. More specifically, it has a similar form to the equation for the 
Carnot efficiency, Eq. 8.42. Carnot efficiency describes the temperature 
dependence of the efficiency of all devices which convert a temperature 
difference to another form of energy. It was introduced in the context of 
thermoelectric devices, but it applies to pyroelectric devices, steam tur¬ 
bines, and other devices too. These equations also have a similar form 
to Eq. 7.23 which modeled the effect of mirror refiectivity and optical 
absorption on the efficiency of a laser. 

9.5 Battery Types 

9.5.1 Battery Variety 

An ideal battery has many desirable qualities. It should: 

• have high specific energy and energy density 

• contain no toxic chemicals so that it is environmentally friendly and 
easy to dispose of safely 

• be safe to use 

• be inexpensive 

• be rechargeable 

• require no complicated procedure to recharge 

• be able to output large current 

• be able to withstand a wide range of temperatures 

• produce a constant voltage output throughout its life (have a fiat 
discharge curve) 

• remain charged for a long time while in storage 

The list above is not complete, and it is in no particular order. Tradeoffs 
are needed because many of these qualities inherently contradict. Eor ex¬ 
ample, a device with a high specific energy necessarily requires more safety 
precautions and controlled use than a device with low specific energy. 

Batteries are used in a wide range of applications, so one type is not best 
in all situations. As an example, a car ignition battery must be recharge¬ 
able, have high capacity, output large current, and operate over a wide 
temperature range. However, car batteries do not require particularly high 
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specific energies. As another example, tiny batteries are used to power 
microelectromechanical systems such as micropumps [142] [143]. These 
batteries must have high specific energy and be able to be produced in 
small packages. Some are even built into integrated circuits [144] [145]. 

One way to classify batteries is as primary or secondary. A primary 
battery is used once, then disposed. A secondary battery is a rechargeable 
battery. Primary batteries have the advantage of simplicity [128, ch. 8]. 
They do not require maintenance, so they are simple to use. Also, their 
construction may be simpler than secondary batteries because they do not 
need additional circuitry built in to monitor or control the recharging pro¬ 
cess. They often have high specific energy too [128, ch. 8]. They come 
in a variety of sizes and shapes, and they are made with a variety of elec¬ 
trode and electrolyte materials. Many alkaline and lithium ion batteries 
are designed to be primary batteries. Secondary batteries have the obvi¬ 
ous advantage of not producing as much waste that ends up in a landfill. 
Also, the user does not need to continually purchase replacements. While 
secondary batteries may cost more initially, they can be cheaper in long 
run. They are often designed to be recharged thousands of times [128, ch. 
15]. Many secondary batteries have a very fiat discharge curve, so they 
produce a constant voltage throughout use, even upon multiple charging 
cycles [128, ch. 15]. Two of the most common types of secondary batteries 
are lead acid batteries and lithium batteries. 

There are many battery types, distinguished by choice of electrolyte 
and electrodes. Four common battery types are discussed in this section: 
lead acid, alkaline, nickel metal hydride, and lithium. Not all batteries 
fit into one of these families. Some devices, like zinc air batteries, are 
even harder to categorize. Zinc air batteries are actually battery fuel cell 
hybrids because the zinc of the anode is consumed as in battery operation 
while oxygen from air is consumed as in fuel cell operation. However, by 
considering these four classes, we will see some of the variety available. For 
a more thorough and encyclopedic discussion of battery types, see reference 
| 128 |. 

Table 9.2 summarizes example batteries of each of these four types. The 
first three rows list example materials used to make the anode, cathode, 
and electrolyte for batteries. Materials listed in the table are just examples, 
so batteries of each type can be made with a variety of other materials too. 
The next two rows give approximate values for the specific energy in units of 
All values are approximate values for representative devices provided 
to give an approximate value for comparison, not necessarily values for a 
particular device. The fifth row lists example values for the theoretical 
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Lead acid 

Alkaline 

Lithium 

Nickel 

Metal 

Hydride 

Example 

anode 

material 

Pb 

Zn 

Li 

LaNis 

Example 

cathode 

material 

Pb02 

Mn02 

CP or 

Mn02 

NiOOH 

Example 

electrolyte 

H 2 SO 4 

KOH or 
NaOH 

Organic 
solvents 
and LiBP 4 

KOH 

Example 

applications 

Car 

ignitions 

Toys 

Cellphones, 

Medical 

devices 

Power tools 

Theoretical 

specific 

Wh 

energy, 

252 

358 

448 

240 

Practical 

specific 

Wh 

energy, 

35 

154 

200 

100 

References 

[128, p. 

15.11] [140] 

[128, p. 

8.10] [140] 

[128, p. 

15.1, p. 

31.5] 

[128, p. 

15.1] [146] 


Table 9.2: Example material components and specific energy values for 
batteries based on different chemistries. 


specific energy of the chemical reaction involved while the sixth row lists 
example specific energy values for practical devices which are necessarily 
lower than the theoretical values. The specific energy values in the table 
can be compared to specific energy of various other materials or energy 
conversion devices listed in Appendix D. 


9.5.2 Lead Acid 

Lead acid batteries are secondary batteries which typically have an anode 
of Pb and a cathode of Pb 02 [128, ch. 15]. The electrolyte is a liquid 
solution of the acid H 2 SO 4 which ionizes into 2H’'' and 804 “. The reaction 
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at the anode is 

Pb+SO^ ^ PbS04 + 2e- (9.48) 

with a redox potential of Vrp = 0.37 V [140]. The reaction at the cathode 
is 

PbOs + SO^ + 4H+ + 2e- ^ PbS04 + 2 H 2 O (9.49) 

with a redox potential of Vrp = 1.685 V [140]. The overall cell voltage is 
Vceii = 2.055 V, so in a car battery, six cells are packaged in series. 

Lead acid batteries have a long history. The development of the battery 
dates to the work of Volta around 1795 [3, p. 2], and practical lead acid 
batteries were first developed around 1860 by Raymond Gaston Plante 
[128, p. 16.1.1]. Today, lead acid batteries are used to start the ignition 
system in cars and trucks, used as stationary backup power systems, and 
used in other applications requiring large capacity and large output current. 
Typically, lead acid batteries can handle relatively high current, and they 
operate well over a wide temperature range [128, p. 15.2]. Additionally, 
they have a fiat discharge curve [128, p. 15.2]. Other types of batteries have 
a higher energy density and specific energy, so lead acid batteries are used 
in situations where specific energy is less of a concern than other factors. 

9.5.3 Alkaline 

Alkaline batteries typically have a zinc anode and a manganese dioxide 
Mn02 cathode [128, p. 8.10]. Figure 9.7 shows naturally occurring man¬ 
ganese dioxide (the dark mineral) on feldspar (the white mineral) from 
Ruggles mine near Grafton, New Hampshire. The batteries are called al¬ 
kaline due to the use of an alkaline electrolyte, typically a liquid potassium 
hydroxide KOH solution [128, p. 8.10]. Most alkaline batteries are primary 
batteries, but some secondary alkaline batteries are available. Alkaline bat¬ 
teries have many nice properties. They can handle high current outputs, 
they are inexpensive, and they operate well over a wide temperature range 
[128, p. 8.10]. One limitation, though, is that they have a sloping discharge 
curve [128, p. 8.10]. Alkaline batteries were originally developed for mil¬ 
itary applications during WWII [128, ch. 8]. They became commercially 
available in 1959, and they became popular in the 1980s with improvements 
in their quality [128, p.11.1]. They are commonly used today in inexpensive 
electronics, toys, and gadgets. 
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Eigure 9.7: Naturally occurring manganese dioxide (the dark mineral) on 
feldspar (the white mineral). 

9.5.4 Nickel Metal Hydride 

Nickel metal hydride batteries have an anode made from a nickel metal alloy 
saturated with hydrogen. One example alloy used is LaNis [146]. Another 
rare earth atom may replace the lanthanum [146], and other alloys like 
TiNi 2 or ZrNi 2 saturated with hydrogen are also used as anode materials 
[146]. The cathode is typically made from a nickel oxide, and the electrolyte 
is potassium hydroxide, KOH [128, p. 15.11]. The reaction at the anode is 
[146] 

Alloy(H) + OH" ^ Alloy + H 2 O + e" (9.50) 

and the reaction at the cathode is [146] 

Ni00H + H20 + e-^ Ni(OH)2 + OH". (9.51) 

This cathode reaction has a redox potential of Kp = 0.52 V [137]. 

Nickel metal hydride batteries have many advantages. They have a flat 
discharge curve. They are secondary batteries which can be charged reli¬ 
ably many times [128, p. 15.1] [147]. Additionally, they are better for the 
environment than the related nickel cadmium batteries, so there are less 
constraints on how they can be safely disposed [147]. However, they do 
not have as high of energy density as lithium batteries [147]. Nickel metal 
hydride batteries were first developed in the 1960s for satellite applications, 
and research into them accelerated in the 1970s and 1980s. At the time, 
they were used in early laptops and cellphones, but lithium batteries are 
used in these applications today [128, p. 22.1]. They are found now in 
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Figure 9.8: The illustration shows a nickel-hydrogen battery and orbital 
replacement unit which powers the International Space Station. This figure 
is used with permission from [148]. 


some portable tools, in some cameras, and in some electronics requiring 
repeated recharging cycles or requiring high current output. The Interna¬ 
tional Space Station is powered by 48 orbital replacement units, and each 
orbital replacement unit contains 38 nickel-hydrogen battery cells. Figure 
9.8 illustrates an orbital replacement unit [148]. 

9.5.5 Lithium 

Lithium has a high specific energy, so it is very reactive and a good choice 
for battery research. For this reason, many different battery chemistries 
utilizing lithium have been developed. The anode may be made out of 
lithium or carbon [128, ch. 8,15]. Possible cathode materials include Mn02, 
LiCo02, and FeS 2 [128, ch. 8,15]. Electrolytes may be liquid or solid. A 
possible electrolyte is the mixture of an organic solvent such as propylene 
carbonate and dimethoxyethane mixed with lithium salts such as LiBF 4 or 
LiC 104 [128, p. 31.5]. Figure 9.9 shows lepidolite, a lithium containing ore 
of composition K(Li,Al) 2 _ 3 (AlSi 30 io)( 0 ,OH,F) 2 , from Ruggles mine near 
Grafton, New Hampshire. 

Lithium batteries have been in development since the 1960s, and they 
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were used in the 1970s in military applications [128, p. 14.1], Both primary 
and secondary lithium batteries are available today. They are popular due 
to their high specific energy and energy density. They are used in many 
consumer goods including cellphones, laptops, portable electronics, hearing 
aids, and other medical devices [149]. Many lithium batteries are designed 
to output relatively low current to prevent damage, and secondary lithium 
batteries require controlled recharging to prevent damage [128, ch. 15]. 
Even with these limitations, over 250 million cells are produced each month 
[128, ch. 15]. 



Eigure 9.9: Naturally occurring lepidolite, an ore of lithium. 


9.6 Fuel Cells 

9.6.1 Components of Fuel Cells and Fuel Cell Systems 

A fuel cell is a device which converts chemical energy to electrical energy 
through the oxidation of a fuel. Like batteries, all fuel cells contain an 
anode from which electrons and ions flow away, a cathode from which elec¬ 
trons and ions flow towards, and an electrolyte. The electrodes are typically 
porous which makes it easier for the fuel and oxidizer to get to the reac¬ 
tion site, provides more surface area for the reaction to occur, allows for a 
higher current through the electrode, and allows for less catalyst to be used 
[60, ch. 5]. The electrolyte may be a liquid or a solid. Examples of liquid 
electrolytes include potassium hydroxide solution and phosphoric acid so¬ 
lution [128]. Examples of solid electrolytes include (Zr02)o.85(CaO)Q ^5 and 
(Zr 02 )o. 9 (Y 203 )o.i [60]. Also like a battery, individual cells may be stacked 
together in a package. A single fuel cell may have a cell voltage of a few 
volts, but multiple cells may be packaged together in series to produce tens 
or hundreds of volts from the unit. 
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Fuel cell components 

Anode 

Cathode 

Electrolyte 

Membrane 

Catalyst 

Fuel 

Oxidizer 


Fuel cell system components 

Fuel processor 

Flow plates 

Heat recovery system 

Inverter 

Other electronics 


Table 9.3: Fuel cell components. 


In addition to these components, fuel cells often contain a thin polymer 
membrane, and fuel cell electrodes are often coated with a catalyst which 
speeds up the chemical reaction. An example material used to make the 
membrane is a 0.076 cm layer of polystyrene [60, ch. 10]. Another example 
membrane is polybenzimidazole containing phosphoric acid [128, ch. 37]. 
Membranes allow ions, but not the fuel and oxidizer to pass through [60, 
ch. 10]. In addition to selectively allowing ions to pass through, membranes 
should be chemically stable to not break down in the presence of the often 
acidic or alkaline electrolyte, should be electrical insulators, and should be 
mechanically stable [60, ch. 10]. A useful catalyst speeds up the reaction at 
the electrodes. In addition, a good catalyst must not dissolve or oxidize in 
the presence of the electrolyte, fuel, and oxidizer [60, ch. 8]. Additionally, 
it should only catalyze the desired reaction, not other reactions [60, ch. 
8]. Examples of catalysts used include platinum, nickel, acetylacetone, and 
sodium tungsten bronze NUajWOs with 0.2 < a; < 0.93, [60, ch. 6]. 

During operation, the fuel and oxidizer are continuously supplied to 
the device. Euel may be in the form of a gas such as hydrogen or carbon 
monoxide gas, it may be in the form of a liquid such as methanol or am¬ 
monia, or it may be in the form of a solid such as coal [60, ch. 10]. Oxygen 
gas or air which contains oxygen is typically used as the oxidizer [60, ch. 
10 |. 

Additional chemical, mechanical, thermal, and electrical components 
are often included in an entire fuel cell system. Some fuel cell systems 
include a fuel processor which breaks down the fuel to convert it to a usable 
form and which filters out impurities [141]. Eor example, a fuel processor 
may take in coal and produce smaller hydrocarbons which are used as fuel. 
Also, fuel cells system may contain flow plates which channel the fuel and 
oxidizer to the electrodes and channel away the waste products and heat 
[141]. Some fuel cells include heat recovery systems, built in thermoelectric 
devices which convert some of the heat generated back to electricity. Eor 
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systems intended to be connected to the electrical grid, inverters which 
convert the DC power from the fuel cell to AC are included. A fuel cell 
system also typically include a control system that regulates the flow of the 
fuel and oxidizer, monitors the temperature of the device, and manages its 
overall operation [128, ch. 37]. 


9.6.2 Types and Examples 

Enel cells may be classified in different ways. One way is by operating 
temperature: low 25-100 °C, medium 100-500 °C, high 500-1000 °C, and 
very high over 1000 °C [60, ch. 1]. Chemical reactions typically occur more 
quickly at higher temperatures. However, one challenge of designing high 
temperature fuel cells is that materials must be selected that can withstand 
the high temperatures without melting or corroding [60, ch. 2]. 

As with batteries, another way to classify fuel cells is as primary or 
secondary [60, ch. 1]. In a primary fuel cell, also called nonregenerative, 
the reactants are used once then discarded. In secondary fuel cells, also 
called regenerative, the reactants are used repeatedly. An external source 
of energy is needed to refresh the fuel for reuse, and this source may supply 
energy electrically, thermally, photochemically, or radiochemically [60, p. 
515]. Both primary and secondary fuel cells have been made with a variety 
of organic and inorganic fuels [60, p. 515]. 

Another way to classify fuel cells is as direct or indirect [60, ch. 1,7] 
[128, ch. 37]. In a direct fuel cell, the fuel is used as is. In an indirect fuel 
cell, the fuel is processed first inside the system. Eor example, an indirect 
fuel cell may take in coal and use an enzyme to break it down into smaller 
hydrocarbons before the reaction of the cell [60, ch. 7]. 

Eamilies of fuel cells are often distinguished by the type of electrolyte 
used. Examples include alkaline which use a potassium hydroxide solution 
as the electrolyte, phosphoric acid, molten carbonate, and solid oxide which 
use solid ceramic electrolytes. Other times, fuel cells are categorized by the 
type of membrane or the type of fuel used. Two of the most common types 
of fuel cells are proton exchange membrane fuel cells and direct methanol 
fuel cells [128, ch. 37]. Proton exchange membrane fuel cells use hydrogen 
gas as the fuel, oxygen from air as the oxidizer, a solid electrolyte, and 
a platinum catalyst [128] [141]. They operate at low temperature and 
are used in buses, aerospace applications, and for backup power. Direct 
methanol fuel cells use methanol as a fuel. They also often operate at low 
or medium temperatures [128] and are used for similar applications. 



232 


9.6 Fuel Cells 


9.6.3 Practical Considerations of Fuel Cells 

The history of fuel cells goes back almost as long as the history of batteries. 
The concept of the fuel cell dates to around 1802 [3, p. 2,222] [60, p. v]. 
Working fuel cells were demonstrated in the 1830s [3, p. 222] [60, p. v], 
and the first practical device was built in 1959 as pure materials became 
commercially available [5, p. 46] [60, p. v, 26]. While both batteries and 
fuel cells are commercially available, batteries have found a home inside 
almost every every car, computer, and electronic devices while fuel cells 
are more specialized products. There are a number of limitations of fuel 
cell technology that have prevented more widespread use. One limitation 
is their cost. Some fuel cells use platinum as the catalyst, and platinum is 
not cheap. Some cells that do not use platinum catalysts have the problem 
that their efficiency is reduced in the presence of carbon monoxide or carbon 
dioxide, which are commonly found in air. Hydrogen gas or methane are 
used as the fuel in some cells, and the delivery and storage of these fuels 
pose challenges. Additionally, some of the more efficient systems are large 
and require fixed space, air or water cooling, and additional infrastructure, 
so these devices do not lend themselves to portable applications. 

Fuel cells have advantages which lead to useful applications. Many fuel 
cells produce no harmful outputs. If hydrogen gas is used as the fuel and 
oxygen from the air is used as the oxidizer, the only byproduct is pure water. 
It is hard to find an energy conversion device which generates electricity 
and is easier on the environment than this type of fuel cell. The left part 
of Fig. 9.10 shows a photograph of a proton exchange membrane fuel cell. 
The right part of Fig.9.10 shows an image of the water formed during its op¬ 
eration. The image was obtained by the neutron radiography method, and 
it was taken at the National Institute of Standards and Technology Center 
for Neutron Research in Gaithersburg, Maryland. These figures are used 
with permission from [150]. In some applications, the water production is a 
main advantage. NASA space vehicles have used fuel cells to produce both 
electricity and pure water since the Gemini and Apollo projects dating to 
the 1960s [3, p. 250]. They have been used to produce both electricity 
and water on military submarines since the 1960s too [3, p. 250]. Another 
advantage of fuel cells is that they can be more efficient than other devices 
which generate electricity. High temperature and higher power units can 
have efficiencies up to 65% [128]. Since some of the highest efficiencies are 
achieved in higher temperature and higher power devices, fuel cells have 
found a niche in large and stationary applications generating kilowatts or 
megawatts of electricity. 
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Eigure 9.10: The picture on the left shows a proton exchange membrane 
fuel cell. The figure on the right is an image of the water formed in it 
during operation. The image was obtained by the neutron radiography 
method at the NIST Center for Neutron Research. These figures are used 
with permission [150]. 
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9.7 Problems 


9.1. A 50 liter solution contains 8 ■ 10^® ions. Calculate the pH. Is this 
solution acidic or basic? 

9.2. A bottle contains 3 liters of a chemical solution with a pH of 8. 

(a) Does the bottle contain an acid or a base? 

(b) Approximately how many H’*' ions are in the bottle? 

(c) Would a 3 liter bottle with a pH of 9 contain more or less ions 
of H^ than the bottle with a pH of 8? 

(d) How many times as many/few H^ ions are in the bottle with 
solution of pH 8 than in the bottle with solution of pH 9? 

9.3. Consider a battery with a lithium electrode and a silver chloride 
(AgCl) electrode. Assume the following chemical reactions occur in 
the battery, and the redox potential for each reaction is shown. 

AgCl + e“ —)■ Ag + Cl~ Vrp = 0.22 V 
Li —> Li+ + e~ Vrp = 3.04 V 

(a) Which reaction is likely to occur at the cathode, and which 
reaction is likely to occur at the anode? Justify your answer. 

(b) What is the overall theoretical cell voltage? 

(c) If the battery is connected to a 1 kD load, approximately what 
is the power delivered to that load? 

9.4. Suppose the chemical reactions and corresponding redox potentials 
in a battery are given by [137]: 

Li —)■ Li"*" + e~ Vrp = 3.04 V 
S+2e- ^ S^- Vrp = -0.57 V 

(a) Find the overall theoretical specific capacity of the battery in 
C 

g- 

(b) Find the overall theoretical specific energy of the battery in g. 

(c) Which material, lithium or sulfur, gets oxidized, and which ma¬ 
terial gets reduced? 
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9.5. A battery has specific capacity 252 ^ and mass of 50 g. Its overall 
density is 2.245 

(a) Eind the specific capacity in 

(b) Eind the capacity in mA-h. 

(c) Eind the charge density in 

9.6. A battery has a specific capacity of 55 and a nominal voltage 

of 2.4 V. The battery has a mass of 165 g. Eind the energy stored in 
the battery in J. 

9.7. A battery has a sulfur cathode where the reaction S + 2e“ —)■ S^“ 
occurs. The anode is made from a mystery material, X, and at the 

anode, the reaction X —)■ + 2e“ occurs. The theoretical specific 

A h 

capacity of the sulfur reaction is 1.76 and the theoretical specific 

A h 

capacity of material X is 0.819 . The theoretical specific capacity 

A h 

of the materials combined is 0.559 What is material X, and what 
is Vrp, the redox potential of the battery? 

(Hint: Use a periodic table and a fist of redox potentials.) 

9.8. What is the difference between each of the items in the pairs below? 

• A battery and a fuel cell 

• A primary battery and a secondary battery 

• Redox potential and chemical potential 

9.9. Consider the polymer electrolyte membrane fuel cell shown below. 
The reactions at the electrodes are: 

Ha 2H+ + 2e“ 

4e- + 4H+ + 02^ 2 H 2 O 

Match the label in the picture to the component name. 
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\ 

c 

Component name 

p 

Label 

1. Anode 


2. Cathode 


3. Electrolyte 


4. Load 


5. Polymer electrolyte membrane 



9.10. Match the name of the fuel cell components to a material used to 
make that component. 


Fuel cell component Material 

name 


1. Anode 


A. Platinum 

2. Byproduct 


B. Carbon 

(waste produced) 


(solid, but porous) 

3. Catalyst 


C. Water 

4. Electrolyte 


D, Sulfuric Acid 

5. Enel 


E. liquid Hydrogen 
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10 Miscellaneous Energy Conversion Devices 

10.1 Introduction 

This text is limited to discussing energy conversion devices that involve 
relatively low powers and that involve electrical energy. Eurthermore, this 
text excludes energy conversion devices involving magnets and coils. Even 
within these limitations, a wide variety of energy conversion devices have 
been discussed. This chapter briefly mentions a few additional devices that 
meet these criteria. Many more devices exist, and with continued creativity 
and ingenuity by scientists and engineers, more will be developed in the 
future. 


10.2 Thermionic Devices 

Thermionic devices convert thermal energy to electricity using the thermionic 
effect [3, p. 182]. A thermionic device consists of a vacuum tube with elec¬ 
trodes in it. The metal cathode is heated until electrons start evaporating 
off the metal. The electrons collect at the anode which is at a colder tem¬ 
perature. In a typical device, the cathode may be at a temperature of 1500 
°C, and the distance between the anode and the cathode may be 10 /rm [60]. 

A device based on this effect was first patented by Thomas Edison in 1883. 
The Carnot efficiency limits this effect because a temperature differential 
is converted to electricity [5]. Efficiencies up to 12% have been measured. 
However, for a given temperature differential, other methods of convert¬ 
ing temperature difference to electricity are often more efficient. Cathodes 
have been made from tungsten, molybdenum, tantalum, and barium ox¬ 
ide [3]. The cathode gets used up in the process and eventually needs to 
be replaced. Anodes have been made from copper, cesium, nickel, barium 
oxide, strontium oxide, and silver [3] [60]. Some gas chromatographs use 
nitrogen phosphorous thermionic detectors [151]. 

10.3 Radiation Detectors 

Radiation detectors convert energy from radioactive sources to electricity. 
Excessive radiation can be harmful to people, and humans cannot sense 
radioactivity. We can only measure it indirectly. Eor these reasons, ra¬ 
diation detectors are used as safety devices. Radiation can be classified 
as alpha particles, beta particles, gamma rays, or neutrons [37, p. 404]. 
Alpha particles are positively charged radiation composed of ionized nu¬ 
clei of helium. Beta particles are high energy electrons. Gamma rays are 
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high energy, short wavelength electromagnetic radiation. When these three 
types of radiation interact with air or another gas, they can excite or ionize 
the atoms of the gas. Flowing ions are a current, so this process converts 
the radiation to electricity. Types of radiation detectors include ionization 
chambers, Geiger counters, scintillation counters, and photographic film 
based detectors [37]. 

Ionization chambers and Geiger counters work on the same principle. 
In both cases, a gas is enclosed in a chamber or tube, and a voltage is 
applied across the gas [37]. Incoming alpha particles, beta particles, or 
gamma rays, ionize the gas. Due to the applied voltage, positive ions 
flow to one of the electrodes, and negative ions flow to the other electrode 
thereby forming a current. Geiger counters operate at higher voltages than 
ionization chambers. The voltage between the electrodes in an ionization 
chamber may be from a few volts to hundreds of volts while the voltage is 
a Geiger counter is typically from 500 V to 2000 V [37]. Many smoke de¬ 
tectors are ionization chambers [152]. When no smoke is present, radiation 
from a weak radiation source ionizes air between the electrodes, and a cur¬ 
rent is detected on the electrodes. When smoke is present, it scatters the 
radiation, so no current is detected [152]. In an ionization chamber, each 
incoming radioactive particle causes a single atom to ionize. In a Geiger 
counter, an incoming radioactive particle causes an atom to ionize. Then, 
the ions formed ionize additional atoms of the gas, and these ions ionize 
additional atoms forming a cascading reaction powered and maintained by 
the voltage gradient which accelerates and separates the ion pairs. Geiger 
counters are often more sensitive due to this amplification of the current 
produced. 

Scintillation counters and photographic film based detectors involve an 
additional step in converting radiation to electricity. A scintillation counter 
is often made from a crystalline material such as sodium iodide [37]. Some¬ 
times a phosphor is also used [5, p. 166]. Incoming radioactive particles 
excite, but do not ionize, the atoms of the material. These atoms then 
decay and emit a photon. Semiconductor or other types of photodetectors 
convert the photons to electricity [37]. In photographic film based detectors, 
incoming radioactive particles expose the film thereby changing its color 
[37]. Materials used in the film include AI 2 O 3 and lithium fluoride [153]. 
Again, photodetectors are used to convert the information recorded on the 
film to a measurable signal. Scintillation counters can be higher sensitivity 
than other types of radiation detectors, and they can be used to determine 
the energy of incoming radiation by spectroscopy [154]. The film based 
detectors can be worn as a ring or badge. These type of detectors are used, 
for example, by radiology technicians and by nuclear power plant employ- 
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ees. These detectors must be sent in to a lab to be analyzed, and both the 
amount and the type of radiation can be determined [153]. 

10.4 Biological Energy Conversion 

The human body can be considered an energy conversion device. Humans 
take in chemical energy in the form of food and convert it to kinetic energy, 
heat, and other forms of energy. Some components of the human body are 
also energy conversion devices. Muscles convert chemical energy to kinetic 
energy. Photoreceptors in the retina of the eye convert optical energy of 
photons to electrical energy of neurons. The ear converts sound waves to 
energy stored in the pressure of the fluid of the inner ear, kinetic energy of 
moving hairs that line the cochlea of the inner ear, and electrical energy of 
neurons. The human body also stores energy. Muscles can store energy as 
they stretch and contract. Human fat cells store energy in chemical form. 
When you walk, your center of mass moves up and down storing energy in 
pendulum-like motion [155]. Additionally, bone, skin, and collagen exhibit 
piezoelectricity [156]. 

Neurons are nerve cells that convert chemical energy to electrical en¬ 
ergy. The human brain has around 10^^ neurons [157, p. 135]. They are 
composed of a cell body, an axon, dendrites, and synapses [158]. The axon 
is the flbrous part that transmits information to other neurons. The den¬ 
drites are the flbrous part that receives information from other neurons. A 
synapse is a gap between neurons. Ions, such as Na"*", K"*", or CU, build 
up on the membrane or in the gap between two neurons, and the charge 
separation of the ions causes an electrical potential [157]. Ions sometimes 
cross the gap between neurons. Neurons may be classifled as sensory af- 
ferents, interneurons, or motoneurons [157]. Sensory afferents transmit a 
signal from sensory receptors to the nervous system. Interneurons transmit 
the signal throughout the nervous system, and motoneurons transmit the 
signal from the nervous system to muscles [157]. Electrical signals trans¬ 
mitted along the nervous system involve pulses with a duration of a few 
milliseconds [157]. The information is encoded in the frequency rate of the 
pulses [157]. 

10.5 Resistive Sensors 

Sensors may be made from capacitive, inductive, or resistive materials. 
These sensors may involve direct energy conversion or may involve mul¬ 
tiple energy conversion processes. In Chapters 2 and 3 capacitive energy 
conversion devices were discussed. The capacitance C of a parallel plate 
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capacitor is given by 

( 10 , 1 ) 

^thick 


If the permittivity e, cross sectional area A, or separation of the plates 
dthick change with respect to any effect, we can make a capacitive sensor. 
Capacitive sensors are calibrated devices which involve energy conversion 
between electricity and material polarization. While most inductive energy 
conversion devices are outside the scope of this book, a few such devices 
were discussed in Chapters 4 and 5. The inductance L of a single turn 
inductor is given by 


L = 




w 


( 10 . 2 ) 


If the permeability /r, thickness dthick^ or width w change with respect to 
any effect, we can make an inductive sensor which utilizes energy conversion 
between electricity and magnetic energy. Similarly, the resistance i? of a 
uniform resistive device is given by 


R = 


pL 

A' 


(10.3) 


If the resistivity p, length /, or cross sectional area A change with respect 
to any effect, we can make a resistive sensor. When a current is applied 
through a resistive sensor, energy is converted from electricity to heat, and 
a resistive sensor is calibrated so that a given voltage drop corresponds to 
a known change in some parameter. 

Many resistive senors are available. A potentiometer is a variable re¬ 
sistor. As current flows through it, energy is converted from electricity 
to heat. When the knob of a potentiometer is turned, the length of the 
material through which the current flows is changed, so the rate of energy 
conversion through the device changes. A resistanee temperature deteetor 
converts a temperature difference to electricity [37, p. 88]. Resistance tem¬ 
perature detectors work based on the idea of the Thomson effect discussed 
in Section 8.6.1. In these devices, the resistivity varies with temperature. 
When a strain is applied to a resistive strain gauge, both the length and 
cross sectional area of the device change. Pirani hot wire gauges are used 
to measure pressure in low pressure environments [37, p. 97]. In a Pi¬ 
rani gauge, current is applied through a metallic filament, and the filament 
heats up. As air molecules hit the filament, heat is transferred away from 
it. The resistance of the filament depends on temperature, and the fila¬ 
ment cools more quickly in an environment with more air molecules than 
in an environment at a lower pressure. By monitoring the resistance of the 
filament, the pressure can be determined. 
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Eigure 10.1: A constricted pipe used to illustrate Bernoulli’s equation. 

10.6 Electrofluidics 

Electrohydrodynamic devices (EHDs) convert between electrical energy and 
fluid flow. These devices are also known as electrokinetic devices. Microflu¬ 
idic devices are EHD devices that are patterned on a single silicon wafer or 
other substrate, and length scales are often less than a millimeter [159]. En¬ 
gineers have built EHD pumps, valves, mixers, separators, and other EHD 
devices [159] [160]. Electrohydrodynamic or microfluidic devices have been 
used in products including ink jet printers, chemical detectors, machines 
for DNA sequencing or protein analysis, and insulin pumps [61] [160] [161]. 

Some EHD devices operate based on the idea of Bernoulli’s equation, 
and this relationship is a direct consequence of energy conservation. To 
illustrate the fundamental physics of this idea, begin by considering a sim¬ 
pler device, a constricted pipe. This pipe converts energy from a pressure 
differential to kinetic energy [103, ch. 3] [162, p. 346]. Consider a fluid with 
zero viscosity and zero thermal conductivity flowing through a horizontal 
pipe (so gravity can be ignored). Eigure 10.1 illustrates this geometry. 
The velocity and pressure P are different at locations with different pipe 
diameter, for example locations 1 and 2 in the figure. Consider a small 
amount of water with mass m = pdens^'^ where pdens is density and AV 
is the small volume. Assume there are two, and only two, components 
of energy: kinetic energy and energy due to the compressed fluid. In go¬ 
ing from location 1 to location 2, the pressure of this little mass of fluid 
changes. Change in energy due to compressing this drop of water is equal 
to (Pi — P 2 ) AV. The kinetic energy also changes, and change in kinetic 
energy is given by 

1 |^|2 _ 1 |^|2 ^ 1 (p,^„,AV) (|EtP - . (10.4) 
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However, energy is conserved, so 

(Pi - P2) AV + i (prfensAV) (|EtP - = 0. (10.5) 

This expression can be simplifled algebraically. 

Pi - P 2 + ]^Pdens\^l? - ]^Pdens\^? = 0 (10.6) 

Both pressure P and velocity are functions of location. The only way 
this expression can be true for all locations is if it is true for each location 
and a constant. 

1 1 

Pi + -pdens\^i? = P2 + -pdens\^? = constant (10.7) 

Bernoulli’s equation with the rather severe assumptions above becomes 

P + ]j^Pdens\~^? = constant. (10.8) 

Bernoulli’s equation is also used to describe the lift of an air foil or the path 
of a curve ball in baseball [162, p. 350]. In some EHDs electricity induces 
changes in the pressure or volume of a microfluidic channel. The fluid in 
these devices may be conductive or insulating. As seen by Eq. 10.8, this 
change in pressure induces a change in fluid velocity. 

In other EHDs, applied voltages exert forces on conductive fluids. A 
charged object in an external electric held ^ feels a force in the direction 
of the electric held. A current in an external magnetic held ^ feels a force. 
The direction of this force is perpendicular to both the direction of the 
current and the direction of the external magnetic held. These effects are 
summarized by the Lorentz force equation 

'f = q{^ + ^ (10.9) 

which was discussed in Chapter 5. In that chapter. Hall effect devices 
and magnetohydrodynamic devices were discussed, both of which can be 
understood by the Lorentz force equation with an external magnetic held 
but no electrical held. This type of EHD can be understood by the Lorentz 
force equation with an external electric held but no magnetic held. The 
liquid in these devices must be conductive. When a voltage is applied 
across this type of EHD, an electric held is induced which causes the liquid 
to flow, and this effect is said to be due to a streaming potential [159]. A 
related effect called eleetrophoresis occurs in liquids which contain charged 
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particles [161]. If an electric field is applied, these particles will move. This 
effect has been demonstrated with charged DNA molecules and charged 
protein molecules in solutions [161]. 

Other EHD devices operate by changing material polarization of an 
insulating liquid, and this effect is called dielectrophoresis. The concept 
of material polarization was discussed in Section 2.2.1. If we apply an 
electric field across a conductor, whether that conductor is a solid or a 
liquid, charges will how. If we apply an electric field across a dielectric, 
the material may polarize. In other words, there will be some net charge 
displacement even if all electrons remain bound to atoms. The external 
electric field causes both the atoms of the liquid to polarize and these 
polarized atoms to how. 

There are a number of other interrelated EHD effects. Electroosmosis 
can occur in fiuids with a surface charge. In some liquids, ions build up on 
the surface due to unpaired chemical bonds, due to ions adsorbed onto the 
surface, or for other reasons. If an electric field is applied across this layer 
of charges, the fiuid will how, and this effect is called electroosmosis [161] 
[159]. Also, an external electric field applied across a fiuid may heat up 
part of the fiuid and cause a temperature gradient. Eluid may how due to 
the temperature gradient, and this effect is called electrothermal flow [161]. 
Another effect, known as electrowetting, occurs in conductive liquids. At 
the interface of a solid conductor and a conductive liquid, charges build up 
[61]. Again, if an electric field is applied, the liquid will how. 
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Part II 

Theoretical Ideas 

11 Calculus of Variations 

11.1 Introduction 

The previous chapters surveyed various energy conversion devices. The pur¬ 
pose of Chapters 11 and 12 is to establish a general framework to describe 
any energy conversion process. By placing energy conversion processes in 
a larger framework, we may be able to see relationships between processes 
or identify additional energy conversion processes to study. Establishing 
this framework requires some abstraction and hence some mathematics. In 
the next section, we define the Principle of Least Action and the idea of 
calculus of variations. In the following sections, we apply these ideas to two 
example energy conversion systems: a mass spring system and a capacitor 
inductor system. 

An advantage of using calculus of variations over other techniques is that 
the analysis is based on energy, which is a scalar, instead of the potential, 
which may be a scalar or vector. Working with a scalar quantity like energy 
instead of a vector can make the mathematics quite a bit more manageable. 

11.2 Lagrangian and Hamiltonian 

Consider a process which converts energy from one form to another. We 
are interested in how some quantity evolves during the energy conversion 
process, and we call this quantity the generalized path, y(t). For simplicity, 
we consider only the case where this path has one independent variable t 
and one dependent variable y. In this chapter, t represents time, but it 
can also represent position or another independent variable. These ideas 
generalize directly to situations with multiple independent and dependent 
variables [163] [164], but the multiple variable problem requires more in¬ 
volved mathematics. The units of generalized path depend on the energy 
conversion process under consideration. In the mass spring example of Sec. 
11.5, it represents position of a mass. In the capacitor inductor example of 
Sec. 11.6, it represents the charge built up on the plates of the capacitor. 
Aside from the energy conversion process under consideration, assume that 
no other energy conversion processes occur, even though this situation is 
unlikely. The system goes from having all energy in the first form to having 
all energy in the second form following the path y{t). 
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Define the Lagrangian C as the difference between the first and second 
forms of energy under consideration. The Lagrangian is a function of t, y, 
and and it has the units of joules. 





(First form of energy) — (Second form of energy) (11.1) 


At any time, the total energy of the system is the sum. Define the Hamil¬ 
tonian H, also in joules, as the total energy. 


H r/, = (First form of energy) + (Second form of energy) (11.2) 

Some forms of energy cannot be described by a Lagrangian of the form 
C (t, y, and instead require a Lagrangian of the form 


dy (Py d^y 

' dt'dp'dt?'" 


(11.3) 


[163, p. 56]. Such forms of energy will not be considered here. Energy is 
conserved in any energy conversion process. Conservation of energy can be 
expressed as 


dH _ dH 
dt dt 


(11.4) 


Derivatives of the Lagrangian will be useful in the discussion below. 
Define the generalized potential as the partial derivative of the Lagrangian 
with respect to the path, |^. The units of the generalized potential de¬ 
pend on the units of the path. More specifically, the units of the generalized 
potential are joules divided by the units of the path. Note that general¬ 
ized potential and potential energy are different ideas. Potential energy 
has units of joules while the units of generalized potential vary. Some au¬ 
thors use the term potential as a synonym for voltage, but this definition 
of generalized potential is more broad. For more information on the dis¬ 
tinction between potential, generalized potential, and potential energy see 
Appendix C. 

Define the generalized momentum M as the partial derivative of the 
Lagrangian with respect to the time derivative of the path. 


M = 



( 11 . 6 ) 
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Many authors use the variable p for generalized momentum. However, M 
will be used here because the variable p is already too overloaded. De¬ 
fine the generalized eapaeity as the ratio of the generalized path to the 
generalized potential. 

. Generalized path 

Generalized capacity=—--—^-— 

Generalized potential 

Gapacity is also discussed in Appendix G. 


11.3 Principle of Least Action 

Define the aetion § as the magnitude of the integral of the Lagrangian 
along the path. 

Assuming the independent variable t represents time in seconds, the action 
will have the units joule seconds. For energy conversion processes, the path 
found in nature experimentally is the path that minimizes the action. This 
idea is known as the Prineiple of Least Aetion or sometimes as Hamilton’s 
prineiple [163, p. 11]. The idea of conservation of energy is contained in 
this principle. 

To find a minimum or maximum of a function, find where the derivative 
of the function is zero. Here, C and H are not quite functions. Instead, 
they are funetionals. A funetion takes a scalar quantity as an input and 
returns a scalar quantity. A funetional takes a function as an input and 
returns a scalar quantity. Both C and H take the function y{t) as input 
and return a scalar quantity in joules. The idea of taking a derivative and 
setting it to zero to find a minimum is still useful, but we have to take 
the derivative with respect to the function y{t). The process of finding the 
maximum or minimum of a functional described by an integral relationship 
is known as ealeulus of variations. 

It is often easier to work with differential relationships than integral 
relationships. We can express the Principle of Least Action as differential 
equation, and it is called the Euler-Lagrange equation. 


dC _ d dC 

ay *a(|) 


( 11 . 8 ) 


If the Lagrangian C is known, we can simplify the Euler-Lagrange equation 
to an equation involving only the unknown path. The resulting equation 
in terms of path y(t) is called the equation of motion. 
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The Lagrangian provides a ton of information about an energy conver¬ 
sion process. If we can describe the difference between two forms of energy 
by a Lagrangian £ (t, y, , we can set up the Euler-Lagrange equation. 
From the Euler-Lagrange equation, we may be able to find the equation of 
motion and solve it. The resulting path minimizes the action and describes 
how the energy conversion process evolves with time. We can find the 
generalized potential of the system as a function of time too. The Euler- 
Lagrange equation is a conservation law for the generalized potential. The 
symmetries of the equation of motion may lead to further conservation laws 
and invariants. These last two ideas, and the math behind them, are often 
known as Noether’s theorem. Noether’s theorem says that there is a very 
close relationship between symmetries of either the path or the equation of 
motion and conservation laws [165] [166]. These ideas are discussed further 
in Sec. 14.5. 

Notice the mix of partial and total derivative symbols in Eq. 11.8. 
Since y(t) depends on only one independent variable, there is no need 
to use partial derivatives in expressing The derivative ^ is written in 
shorthand notation as y, and y may be used in place of The Lagrangian 
C depends on three independent-like variables: t, y, and Thus, the 
partial derivative symbols are used to indicate which partial derivative of 
£ is being considered. 

The first term of the Euler-Lagrange equation, |^, is the generalized po¬ 
tential defined above. The units of the generalized potential are joules over 
units of path, ^ path ' term of the Euler-Lagrange equation 

has these units. For example, if y(t) is in the units of meters, the generalized 
potential is in or newtons. Each term of the Euler-Lagrange equation 
represents a force, and the Euler-Lagrange equation is a conservation re¬ 
lationship about forces. As another example, if the path y(t) represents 
charge in coulombs, then the generalized potential has the units ^ which 
is volts. The Euler-Lagrange equation in this case is a conservation rela¬ 
tionship about voltages. 
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11.4 Derivation of the Euler-Lagrange Equation 

In this section, we use the Principle of Least Action to derive a differential 
relationship for the path, and the result is the Euler-Lagrange equation. 
This derivation closely follows [163, p. 23-33], so see that reference for 
a more rigorous derivation. Assume that we know the Lagrangian which 
describes the difference between two forms of energy, and we know the 
action. We want to find a differential relationship for the path y{t) which 
minimizes the action. This path has the smallest integral over t of the 
difference between the two forms of energy. 

Suppose that the path y{t) minimizes the action and is the path found 
in nature. Consider a path y{t) which is very close to the path y{t). Path 
y{t) is equal to path y{t) plus a small difference. 


y = y + er] (11.9) 

In Eq. 11.9, e is a small parameter, and rj = 7]{t) is a function of t. We can 
evaluate the Lagrangian at this nearby path. 

The Lagrangian of the nearby path y{t) can be related to the Lagrangian 
of the path y{t). 


^ I)=^ <*’ {’%+f i)+ 

Equation 11.11 is written as an expansion in the small parameter e. The 
lowest order terms are shown, and O(e^) indicates that all additional terms 
are multiplied by or higher powers of this small parameter. 

We can also express the difference in the action for paths y and y as an 
expansion in the small parameter e. 


S(y) - S(|/) = £ 



+ 


at ay 


+ 0(e=) 


( 11 , 12 ) 


The term in brackets is called the first variation of the action, and it is 
denoted by the symbol S. 


SS(>i,y) 


"• dC 


at ay 


to 


(11,13) 
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Path y has the least action, and all nearby paths y have larger action. 
Therefore, the small difference §(y)—§(r/) is positive for all possible choices 
of rjit). The only way this can occur is if the first variation is zero. 


SS(r), y)=0 


(11,14) 



at oy 


(11,16) 


If the action is a minimum for path y, then Eq. 11.15 is true. However, if 
path y satisfies Eq. 11.15, the action may or may not be at a minimum. 

Use integration by parts on the second term to put Eq. 11.15 in a more 
familiar form. 


u = 


dy 


, d dC , 
du = —^:—dt 
dt dy 

V = T] 

dv = "^dt 
dt 


'to 


dr] dC 


V 


dy 


1 ti 


d fdC\ 

(11.16) 


to 


'to 


dt \dy 


Assume the endpoints of path y and y align 


fi(fo) = fi(U) = 0. 


(11.17) 


dr] dC , 
-^^dt = - 


'to 


dt dy 

Combine Eq. 11.18 with Eq. 11.15. 

rh 


"'1 d fdC\ , 


'to 


dt \dyj 


0 = 


0 = 


J to 

r 

'to 


dC d fdC\ , 


V 


dy 


d 

dt 


d_^ 

dy 


dt 


(11.18) 


(11.19) 


( 11 . 20 ) 


Eor Eq. 11.20 to be true for all functions y, the term in brackets must be 
zero, and the result is the Euler-Lagrange equation. 


dy dt \dy J 


0 


( 11 , 21 ) 
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We have completed the derivation. Using the Principle of Least Action, 
we have derived the Euler-Lagrange equation. If we know the Lagrangian 
for an energy conversion process, we can use the Euler-Lagrange equation 
to find the path describing how the system evolves as it goes from having 
energy in the first form to the energy in the second form. 

The Euler-Lagrange equation is a second order differential equation. 


The relationship can 

be written instead as a pair of first order differential 

equations. 

dm dC 

(11.22) 


dt dy 

and 

dC 

(11.23) 


M = —. 
dy 

The Hamiltonian can 

be expressed as a function of the generalized 

momen- 

turn, [167, ch. 3]. 

H{t,y,M) = \My-C\ 

(11.24) 

Using the Hamiltonian, the Euler-Lagrange equation can be written as [167] 


dm dH 

(11.25) 


dt dy 

and 

dy dH 

dt ~ dm' 

(11.26) 


This pair of first order differential equations is called Hamilton’s equa¬ 
tions, and they contain the same information as the second order Euler- 
Lagrange equation. They can be used to solve the same types of problems 
as the Euler-Lagrange equation, for example finding the path from the 
Lagrangian. 


11.5 Mass Spring Example 

Examples in this section and the next section will illustrate how we can use 
the Euler-Lagrange equation to find the equation of motion describing an 
energy conversion process. Consider a system comprised of a mass and a 
spring where energy is transfered between spring potential energy stored in 
the compressed spring and kinetic energy of the mass. The mass is specified 
by m in kg. It is attached to a spring with spring constant K in The 
position of the mass is specified by x{t) where x is the dependent variable 
in meters and t is the independent variable time in seconds. Assume this 
mass and spring are either fixed on a level plane or in some other way 
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not influenced by gravity. This mass spring system is illustrated on the 
left side of Fig. 11.1. When the spring is compressed, the system gains 
spring potential energy. When the spring is released, energy is converted 
from spring potential energy to kinetic energy. Assume no other energy 
conversion processes, such as heating due to friction, occur. 


Uncompressed Spring Compressed Spring 



-10 1 -10 1 


Figure 11.1: A mass spring system. 


The right side of Fig. 11.1 shows the compressed spring held in place 
by a restraint. For t < 0, the system has no kinetic energy because the 
mass is not moving, and the system has potential energy in the compressed 
spring. At this time, the mass is at position x where a; < 0. The spring 
exerts a force on the mass. 


f 


spring 


—Kxttx 


(11.27) 


which is in the direction. 

At t = 0, the restraint is removed, and the spring potential energy is 
converted to kinetic energy. The first form is spring potential energy. 


F . — — 

-‘-^potential energy 

The second form is kinetic energy of the mass. 


^ , dx 

Ekinetic = -FI [ — 


(11.28) 


(11.29) 


At any instant of time, when the mass is at location x(t), the total energy 
is represented by the Hamiltonian. 


H 


E, 


total 


= E, 


potential energy 


E, 


kinetic 


(11.30) 


H 


1 

2 


Kx^ + -m 



(11.31) 
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The Lagrangian represents the difference between the forms of energy. 


C 


E: 


potential energy 




kinetic 


c 




(11.32) 

(11.33) 


Both the Hamiltonian and Lagrangian have units of joules. The generalized 
potential is 

dC 

^=Kx (11.34) 


in units of newtons, 
turn is 


Note that Kx = 
dC 


spring- The generalized momen¬ 
ta; 


9 ( 1 ) 


= —m 


dt 


(11.35) 


in units of —|— which is the units of momentum. 

At t = 0, the restraint is removed. The mass follows the path x{t). If 
we know the Lagrangian, we can find the path by trial and error. To find 
the path in this way, guess a path x{t) that the mass follows and calculate 
the action. 


§ 




dt 


(11.36) 


Repeatedly guess another path, and calculate the action. The path with 
the least action of all possible paths is the path that the mass follows. 
This path has the smallest difference between the potential energy and the 
kinetic energy integrated over time. 

We can think of many possible, but not physical, paths x(t) that the 
mass can follow. Figure 11.2 illustrates two nonphysical paths as well as the 
physical path derived below. Paths are considered over the time interval 
0 < t < 1. All three paths assume that initially, at t = 0, the spring is 
compressed so that the mass is at location a;(0) = —1. Also, they assume 
that at the end of the interval, at t = 1, the spring has expanded so that 
the mass is at location x{l) = 1. The possible paths illustrated in the figure 
are 

Xi{t) =2t — 1 (not physical) 

X 2 {t) = 2t^ — 1 (not physical) 


and 


xsit) = — cos(7rt) 


(physical). 
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The path xi(t) describes a case where the mass travels at a constant speed. 
The path X 2 {t) describes a case where the mass accelerates when the re¬ 
straint is removed, and the path Xs{t) describes a case where the mass first 
accelerates then slows. The action of each path can be calculated using Eq. 
11.36. For example purposes, the values of m = 1 kg and K = ^ are 

used. The path Xi{t) has S = 0.355, the path X 2 {t) has S = 0.364, and the 
physical path X 3 {t) has zero action S = 0. 

We can derive the path that minimizes the action and that is found in 
nature using the Euler-Lagrange equation. 


dC _ d dC 
dx ^ 


(11.37) 


The first term is the generalized potential. The second term is the time 
derivative of the generalized momentum. The equation of motion is found 
by putting these pieces together. 


Kx -f m 


d^x 

dt'^ 


0 


(11.38) 


The first term of the equation of motion is 


f 


spring 


The second term 


represents the acceleration of the mass. We have just found the equation of 
motion, and it is a statement of Newton’s second law, force is mass times 
acceleration. It is also a statement of conservation of force on the mass. 

Equation 11.38 is a second order linear differential equation with con¬ 
stant coefficients. It is the famous wave equation, and its solution is well 
known 


x(t) = Co cos 




(11.39) 


where cq and ci are constants determined by the initial conditions. If we 
securely attach the mass to the spring, as opposed to letting the mass get 
kicked away, it will oscillate as described by the path x{t). 

Energy is conserved in this system. To verify conservation of energy, 
we can show that the total energy does not vary with time. The total 
energy is given by the Hamiltonian of Eq. 11.31. In this example, both 
the Hamiltonian and the Lagrangian do not explicitly depend on time, 
^ = 0 and ^ = 0. Instead, they only depend on ehanges in time. For 
this reason, we say both the total energy and the Lagrangian have time 
translation symmetry, or we say they are time invariant. The spring and 
mass behave the same today, a week from today, and a year from today. 
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We can also verify conservation of energy algebraically by showing that 

f = 0- 


dH 

dt 


dH 


dH dx dH d^x 


dx dt 


a (%) de- 

dx d?x 


dH dx 

—— = 0 + Kx— + m , , „ 

dt dt dt dt^ 


dH dx 


dt dt 


d‘^x 


^ \ Kx + m— = 0 


dt'^ 


(11.40) 

(11.41) 

(11.42) 


Notice that the quantity in parentheses in the line above must be zero from 
the equation of motion. 

The Euler-Lagrange equation can be split into a pair of first order dif¬ 
ferential equations called Hamilton’s equations. 


dM _ dH 
dt dx 


and 


dx dH 

~di~lm 


(11.43) 


This example is summarized in Table 11.1. In analogy to language used 
to describe circuits and electromagnetics, the relationship between the gen¬ 
eralized path and the generalized potential is referred to as the constitutive 
relationship. Eollowing Eq. 11.6, the ratio of the generalized path to gen¬ 
eralized potential is the generalized capacity, and in this example, it is the 
inverse of the spring constant. While displacement x is assumed to be 
scalar, the vector it is used in the table for generality. 
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Energy 

storage 

device 

Linear 

Spring 

Generalized 

Path 

Displacement 
"af in m 

Generalized 

Potential 

'P Eorce in 

J- = N 
m 

Generalized 

Gapacity 

1 • m^ 
u in ^ 

Gonstitutive 

relation¬ 

ship 


Energy 

pi'TP = 

iT |;^|2 

2 rr 1 1 

Law for 
potential 

Newton’s 
Second Law 

P = mP 


Table 11.1: Summary of the mass spring system in the language of calculus 
of variations. 
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11.6 Capacitor Inductor Example 

The ideas of calculus of variations apply to energy conversion processes in 
electrical systems too. Consider a circuit with a capacitor and an inductor 
as shown in Figure 11.3. The current ii, the current ic, and the voltage 
V are defined in the figure. Assume that wires and components have no 
resistance. While this is not completely physical, it will allow us to simplify 
the problem. Assume that the capacitor is charged for t < 0, and the switch 
is open. At t = 0, the switch is closed, and the capacitor begins discharging. 
In this example, the generalized path will be the charge built up on the 
plates of the capacitor. We can derive the equation of motion that describes 
this path. 



< = 0 


Figure 11.3: A capacitor inductor system. 

Energy is converted between two forms. The first form of energy in this 
system is electrical energy stored in the capacitor. The voltage v in volts 
across a capacitor is proportional to the charge Q in coulombs across the 
plates of the capacitor. Capacitance C, measured in farads, is the constant 
of proportionality between the two measures. 

Q = Cv (11.44) 

The current-voltage relationship across the capacitor can be found by tak¬ 
ing the derivative with respect to time. 

dQ _ ^dv 
dt dt 

The change in charge build up with respect to time is 
specifically, 

dQ 

Equations 11.45 and 11.46 can be combined. 


(11.45) 
the current. More 


(11.46) 
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The energy stored in a capacitor is 

Ecap = (11.48) 

The second form of energy in this system is the energy stored in the 
magnetic field of the inductor. The current ii through the inductor, mea¬ 
sured in amperes, is proportional to the magnetic flux T, measured in 
webers, around the inductor. Inductance L, measured in henries, is the 
constant of proportionality between the current and magnetic flux. 

4/ = Lii (11.49) 


The current voltage relationship across this inductor can be found by taking 
the derivative with respect to time. 


dT ^diL 

—— = V = L—— 
at at 


(11.50) 


The energy stored in the inductor is given by 



(11,61) 


We describe the energy conversion process by keeping track of a the 
generalized path Q(t), the charge stored on the capacitor. The variable t 
represents the independent variable time in seconds, and Q is the dependent 
variable charge in coulombs. The Hamiltonian and Lagrangian, H and £, 
will be considered functions of three independent-like variables: t, Q, and 

dQ 

dt 

The Hamiltonian is the sum of the energy in the capacitor and the 
energy in the inductor. The Lagrangian is the difference between these 
energies. 


Electrical engineers typically describe physical circuits using the most easily 
measured quantities: current and voltage. However, here to illustrate the 
use of the calculus of variations formalism, we write expressions for both 
the total energy and the Lagrangian in terms of the specified variables: t, 
Q, and f. 




(11.54) 
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C 




(11.55) 


We can find the path, charge on the capacitor as a function of time, by 
solving for the least action 


5 



C 


dx\ 



or by solving the Euler-Lagrange equation. 


dC d dC 

^~dtd{W) 


( 11 . 66 ) 


(11,67) 


In Eq. 11.56, 6 indicates the first variation as defined by Eq. 11.13. Solu¬ 
tions depend on initial conditions such as the charge stored in the capacitor 
and the current in the inductor at the initial time. We can use the Euler- 
Lagrange equation to find the equation of motion. The first term of Eq. 
11.57 is the generalized potential. 


dC _Q 
^~C 


(11.58) 


which is the voltage v in volts. The next term is the derivative of the 
generalized momentum. 


M = 



^dQ 

dt 


(11.59) 


We can put the pieces together to find an expression of conservation of the 
generalized potential. 


Q 

C 


d^Q 

dt"^ 


= 0 


(11.60) 


This is a statement of Kirchhoff’s voltage law. It looks more familiar if it 
is written in terms of voltage v = ^ and current ii = 


= 0 (11.61) 

We can solve the equation of motion, Eq. 11.60, using appropriate initial 
conditions, to find the path. As in the mass spring example, Eq. 11.60 is 
the wave equation, and its solutions are sinusoids. As expected, a circuit 
made of only a capacitor and inductor is an oscillator. 
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Energy 

storage 

device 

Gapacitor 

Linear 

Spring 

Generalized 

Path 

Gharge Q 
in G 

Displacement 
"af in m 

Generalized 

Potential 

Voltage V in 

n = v 

'P Eorce in 

J- = N 
m 

Generalized 

Gapacity 

Gapacitance 
Gin , 

1 • m^ 

u 111 X 

Gonstitutive 

relation¬ 

ship 

Q = Cv 

it - -P 

•l' 

Energy 

ICv^ 


Law for 
potential 

KVL 

Newton’s 
^cond Law 

P = mP 


Table 11.2: Summary of the capacitor inductor system in the language of 
calculus of variations. 


Furthermore, we can show that energy is conserved in this energy con¬ 
version process because the partial derivative of both the total energy and 
the Lagrangian with respect to time are zero. 


dC _ dH 
~ 

dC _ dH 
dt dt 


(11.62) 

(11.63) 


Table 11.2 summarizes this example. It also illustrates the relationship 
between parameters of this example and parameters of the mass spring 
example. 
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11.7 Schrodinger’s Equation 

Quantum mechanics is the study of microscopic systems such as electrons 
or atoms. Calculus of variations and the idea of a Hamiltonian are fun¬ 
damental ideas of quantum mechanics [136]. In Chapter 13, we apply the 
ideas of calculus of variations to an individual atom in a semiclassical way. 

We can never say with certainty where an electron or other microscopic 
particle is located or its energy. However, we can discuss the probability of 
finding it with a specific energy. The probability of finding an electron, for 
example, in a particular energy state is specified by where tjj is called 
the wave function [136]. As with any probability 0<|r^P<l. 

For example, suppose that as an electron moves, kinetic energy is con¬ 
verted to potential energy. The quantum mechanical Hamiltonian Hqm is 
then the sum of the kinetic energy Ekimtic and potential energy Epotentiai energy 


Hqm — E, 


kinetic 


E. 


potential energy 


(11.64) 


Kinetic energy is expressed as 

Ekinetic = (Mqm)'^ (11.65) 

where m is the mass of an electron. In the expression above, Mqm is the 
quantum mechanical momentum operator, and 

{MqmY = AfgM ■ Mqm- (11.66) 


The quantum mechanical momentum operator is defined by 

Mqm = 


(11.67) 


where the quantity h is the Planck constant divided by 27r. The del operator, 
^, was introduced in Sec. 1.6.1, and it represents the spatial derivative of 
a function. The quantities Hqm, Mqm, and ^ are all operators, not just 
values. An operator, such as the derivative operator A, acts on a function. 
It itself is not a function or value. 

Using the of momentum definition of Eq. 11.67 and the vector identity 
of Eq. 1.10, we can rewrite the Hamiltonian. 


-h^ 

Hqm = ——-|- U, 


2m 


potential energy 


( 11 . 68 ) 


In quantum mechanics, the Hamiltonian is related to the total energy. 


Hqm^J = Etotalfj 


( 11 . 69 ) 
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The above two equations can be combined algebraically. 
-h\ 


2m 


V T EpQ^gntial energy ij) = Etotallp 


(11.70) 


With some more algebra, Eq. 11.70 can be rewritten. 


2m 


V t/’ T ^2 {Efotal Epofg^Hg^i energy') t/’ 0 


(11.71) 


Equation 11.71 is the time independent Sehrodinger equation, and it is one 
of the most fundamental equations in quantum mechanics. Energy level 
diagrams were introduced in Section 6.3. Allowed energies illustrated by 
energy level diagrams satisfy the Sehrodinger equation. At least for simple 
atoms and ground state energies, energy level diagrams can be derived by 
solving Sehrodinger equation. 


11.8 Problems 


11.1. In the examples below, identify whether / is a function or a func¬ 
tional. 

• A parabola is described by f{x) = x‘^. 

• Given two forms of energy and a path y{t), f is the Lagrangian 
of the system C (t, y, . 

• Given the magnitude of the velocity |T^(t)| of an object, / rep¬ 
resents the distance that the object travels from time 0 to time 
3600 seconds. 

• Given the position {x,y,z) in space, f{x,y,z) represents the 
distance from that point to the origin. 

11.2. A system has the Lagrangian C(t,y,^) = Eind an 

equation for the path y{t) that minimizes the action C (t, y, dt. 
(The result is nonlinear, so don’t try to solve it.) 

11.3. A system has Lagrangian C (t,y, | + I ' y~‘^- Fiiid the 

corresponding equation of motion. (The result is nonlinear, so don’t 
try to solve it.) 
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11.4. Figure 11.2 illustrates three possible paths for the mass spring system 
and their corresponding actions. The paths considered are: 

xi{t) = 2t — 1 


X2{t) = 2t^ - 1 
xsit) = — cos(7rt) 

For each path, calculate the action using Eq. 11.36 to verify the 
values shown in the figure. Assume a mass of m =1 kg and a spring 
constant of iF = tt^ 

11.5. The figure shows a torsion spring. It can store potential energy 

and it can convert potential energy to kinetic energy ■ In 

these expressions, 0(t) is the magnitude of the angle the spring turns 
in radians, and a; = ^ is the magnitude of the angular velocity in 
radians per second. IK is the torsion spring constant, and I is the 
(constant) moment of inertia. 

(a) Find the Lagrangian. 

(b) Use the Euler-Lagrange equation to find a differential equation 
describing 0(t). 

(c) Show that energy is conserved in this system by showing that 

^ = 0 
dt 

(d) Set up Hamilton’s equations. 



11.6. The purpose of this problem is to derive the shortest path y{x) be¬ 
tween the points (xo,yo) and (xi,yi). Consider an arbitrary path 
between these points as shown in the figure. We can break the path 
into differential elements d I = dxdx + dydy. The magnitude of each 
differential element is 



2 


\d I \ = \/{dxY + {dyY = dxJ 1 + 
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The distance between the points can be described by the action 



To find the path y{x) that minimizes the action, we can solve the 

Euler-Lagrange equation, with C = \JI + as the Lagrangian, 

for this shortest path y{x). This approach can be used because we 
want to minimize the integral of some functional C even though this 
functional does not represent an energy difference [163, p. 33]. 

Set up the Euler-Lagrange equation, and solve it for the shortest 
path, y{x). 

Hint 1: The answer to this problem is that the shortest path be¬ 
tween two points is a straight line. Here, you will derive this result. 
Hint 2: In the examples of this chapter, the Lagrangian had the form 
C [t,y, with independent variable t and path y{t). Here, the La¬ 
grangian has the form C {x,y, where the independent variable is 
position X, and the path is y{x). 

Hint 3: If ^ {something) = 0, then you know that {something) is 
constant. 


{xi,yi) 



11.7. Light travels along the quickest path between two points. This idea 
is known as Fermat’s principle. In a material with relative permit¬ 
tivity Cr and permeability /io, light travels at the constant speed 
where c is the speed of light in free space. In Prob. 11.6, we showed 
that the shortest path between two points is a straight line, so in a 
uniform material, fight will travel along a straight line between two 
points. However, what if light travels across a junction between two 
materials? In this problem, we will answer this question and derive 
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a famous result known as Snell’s law. 

Consider the figure below. Assume that a ray of light travels from 
(xo, yo) to (xi, yi) along the path which takes the shortest time. Mate¬ 
rial 1 has relative permittivity e^i, so the light travels in that material 
at a constant speed Material 2 has relative permittivity 6^2, so 
the fight travels in that material at a constant speed As we 
derived in the Prob. 11.6, the light travels along a straight line in 
material 1, and it travels along a straight line in material 2. However, 
the fines have different slopes as shown in the figure. Assume that 
the junction of the two materials occurs at a; = 0. 

(a) Find an equation for the total time it takes the fight to travel as 
a function of h, the vertical distance at which the path crosses 
the y axis. Note that you are finding a function here, F{h), 
not a functional. You can use the fact that you know the light 
follows a straight fine inside each material to find this function. 

(b) The path followed by the light takes the minimum time, so the 
derivative ^ = 0. Use this idea to find an equation for the 
unknown vertical height h. Your answer can be written as a 
function of the known constants e^i, 6^2, xq, r/o, xi, r/i,and c. 
You do not need to solve for h here, but instead just evaluate 
the derivative and set it to zero. 

(c) Use your result in part b above to derive Snell’s law: 

sin 6i = sin 62 
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11.8. A pendulum converts kinetic energy to and from gravitational poten¬ 
tial energy. As shown in the figure, a ball of mass m is hung by a 
string 1 m long. The pendulum is mounted on a base that is 3 m 
high. As shown in the figure, 6{t) is the angle of the pendulum. The 
kinetic energy of the ball is given by E^inetic = (^) , and the 

gravitational potential energy is given by Ep ^. = mg (3 — cos 6*). The 
quantity g is the gravitational constant, g = 9.^ 


m 

S2' 



3 — cos(6*) 


(a) Find £, the Lagrangian of the system. 

(b) Find 77, the Hamiltonian of the system. 

(c) Set up the Euler-Lagrange equation, and use it to find the equa¬ 
tion of motion for 6{t), the angle of the pendulum as a function 
of time. 

(d) Show that energy is conserved in this system by showing that 

^ = 0 
dt 

The equation of motion found in part c is nonlinear, so don’t try to 
solve it. Interestingly, it does have a closed form solution [164, Ch. 
6 ]. (This problem is a modified version of an example in reference 
|163|.) 

11.9. As shown in the figure, an object of charge Qi and mass m moves 
near a stationary object with charge Q 2 . Assume the mass and the 
charges are constants, and assume the objects are surrounded by free 
space. The kinetic energy of the moving object is converted to or 
from energy stored in the electric field between the objects. The 
kinetic energy of the moving object is given by (^) . The energy 
of the electric field is given by where Cq is the permittivity of 
free space. The distance between the objects is given by x{t). 

(a) Find the Lagrangian of the system. 

(b) Find the generalized momentum. 
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(c) Find the generalized potential. 

(d) Find the equation of motion for the path x{t) of the system. 
(Don’t try to solve this nonlinear equation.) 

(e) Find the total energy of the system. 

(f) Show that energy is conserved in this system. 

^^ 


Qi 


moving 


Q2 


stationary 
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12 Relating Energy Conversion Processes 

12.1 Introduction 

In the previous chapter, the concept of calculus of variations was intro¬ 
duced. The purpose of this chapter is to draw relationships between a 
wide range of energy conversion processes. Processes in electrical engineer¬ 
ing, mechanics, thermodynamics, and chemistry are described using the 
language of calculus of variations. Similarities between the processes are 
highlighted and summarized into tables. 

This chapter illustrates how to apply calculus of variations ideas to dis¬ 
parate branches of science and engineering. Electrical engineers typically 
use current and voltage to describe circuits. Ghemists use temperature, 
pressure, entropy, and volume when describing chemical reactions. Engi¬ 
neers and scientists in each discipline have their own favorite quantities. 
However, energy conversion is a common topic of study. Galculus of varia¬ 
tions provides a unifying language. Scientists and engineers typically spe¬ 
cialize, becoming experts in a particular area. However, open questions are 
more often found at the boundary between disciplines, where there is less 
expertise. Gomparing ideas between different disciplines is useful because 
ideas from one discipline may answer questions in another, and challenges 
in one discipline may pose interesting research questions in another. 

By studying the mass spring system of Sec. 11.5, the resulting equation 
of motion was Newton’s second law. By studying the capacitor inductor 
system of Sec. 11.6, the resulting equation of motion was Kirchoff’s volt¬ 
age law. In this chapter we identify the equation of motion for multiple 
other systems. Through this procedure, we encounter some of the most 
fundamental laws of physics including including Gauss’s laws, conservation 
of momentum, conservation of angular momentum, and the second law of 
thermodynamics. 

The discussion in this chapter is necessarily limited. Entire texts have 
been written about each energy conversion processes discussed. Addition¬ 
ally, the idea of applying calculus of variations to these energy conversion 
processes is not novel. Other authors have compared electrical, mechanical, 
and other types of energy conversion processes too [168] [169]. 

Some rather drastic assumptions are made in this chapter. We assume 
energy is converted between one form and another with no other energy 
conversion process occurring. Eor example in a mass spring system, energy 
is converted between kinetic energy and spring potential energy while ignor¬ 
ing heating due to friction, energy conversion due to gravitational potential 
energy, and so on that might occur in a real system. 
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12.2 Electrical Energy Conversion 

Electrical can be described either in circuits language or electromagnetics 
language. Using circuits language, electrical systems are described by four 
fundamental parameters: charge in coulombs Q, voltage in volts v, mag¬ 
netic flux in webers th, and current in amperes i . Eor circuits described in 
this language, resistors, capacitors, and other electrical energy storage and 
conversion devices are treated as point-like with no length or extent, and 
forces and flelds outside the path of the circuit are ignored. An alternative 
approach is to use electromagnetics language the electrical properties of 
materials are studied as a function of position and forces and flelds outside 
of the path of a circuit are studied. 

We can use circuits language to describe a number of energy conversion 
devices. Resistors convert electrical energy to thermal energy, and ther¬ 
moelectric devices convert thermal energy to or from electrical energy. A 
charging capacitor converts electrical energy to energy stored in a material 
polarization, and a discharging capacitor converts the energy of the mate¬ 
rial polarization back to electrical energy. In an inductor, electrical energy 
is converted to and from energy of a magnetic held. 

In Sec. 11.6, energy storage in a capacitor was studied in detail and 
described in the language of calculus of variations. Table 11.2 summarized 
the use of calculus of variations language to describe the energy conversion 
process, and it is repeated in the second column of the Table 12.1. In that 
example, charge built up on the capacitor plates, Q, was the generalized 
path. The generalized potential was v, the voltage across the capacitor. 
Erom these choices, other parameters were selected. 

Instead of choosing charge Q as the generalized path, we could have 
chosen the generalized path to be one of the other fundamental variables of 
circuit analysis, voltage v, magnetic flux T, or current i. Table 12.1 summa¬ 
rizes parameters that result when we describe energy conversion processes 
occurring in a capacitor or inductor in the language of calculus of varia¬ 
tions with these choices of generalized path. More speciflcally, the third 
column shows parameters when voltage is chosen as the generalized path. 
The fourth column shows parameters when magnetic flux is chosen as the 
generalized path, and the fifth column shows parameters when current is 
chosen as the generalized path. By reading down a column of the table, we 
see how to describe a process with this choice of generalized path. By read¬ 
ing across the rows of the table, we can draw analogies between parameters 
of energy conversion processes. 

To describe the energy conversion processes occurring in a capacitor, we 
can choose either the charge or voltage to be the generalized path then use 
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the language of calculus of variations. Notice that if charge is chosen as the 
generalized path as seen in column two of Table 12.1, voltage becomes the 
generalized potential. However, when voltage is chosen as the generalized 
path as seen in column three, charge becomes the generalized potential. 
The path found in nature minimizes the action, and we saw in Sec. 11.6 
that we could use the Euler-Lagrange equation to set up an equation of 
motion for the system. Each term of the equation of motion has the same 
units as the generalized potential. The equation of motion found when 
using Q as generalized path is Kirchoff’s Voltage Law (KVL), which says 
the sum of all voltage drops around a closed loop in a circuit is zero. The 
equation of motion found when using v as the generalized path is the law 
of conservation of charge. Both of these concepts are fundamental ideas in 
circuit theory, and they are shown in the second to last row of the table. 

Similarly, to describe the energy conversion processes occurring in an 
inductor, we may choose either magnetic flux or current as the generalized 
path. If we choose magnetic flux as the generalized path, the generalized 
potential is current. If we choose current as the generalized path, the 
generalized potential is magnetic flux. Erom the first choice, the equation 
of motion found is Kirchoff’s Gurrent Law (KGL). Erom the second choice, 
the equation of motion found is conservation of magnetic flux. 

The relationship between the generalized path and the generalized po¬ 
tential is known as the constitutive relationship [168, p. 30]. Eor a capaci¬ 
tor, it is given as 

Q = Cv. (12.1) 

The constant C that shows up in this equation is the capacitance in farads. 
Analogously for an inductor, the constitutive relationship is 

^ = Li (12.2) 

where L is the inductance in henries. We will see that we can identify 
constitutive relationships for other energy conversion processes, and we 
similarly can come up with a parameter describing the ability to store 
energy in the device. In analogy to the capacitor, we will call this parameter 
the generalized eapaeity. Gapacitance C represents the ability to store 
energy in the device, so generalized capacity represents the ability to store 
energy in other devices. Overloading of the term capacity is discussed in 
Appendix G. 
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Energy 

storage 

device 

Gapacitor 

Gapacitor 

Inductor 

Inductor 

Generalized 

Path 

Gharge Q 
in G 

Voltage V in 
V 

Mag. Elux 

T in Wb 

Gurrent i in 
A 

Generalized 

Potential 

Voltage V in 
i = V 

Gharge Q 
in G 

Gurrent i in 

A - Jr. 

WE 

Mag. Elux 

T in Wb 

Generalized 

Gapacity 

Gapacitance 
Gin ^ 

1 

c 

Inductance 

L in 

H _ Wb" 

1 

L 

Gonstitutive 

relation¬ 

ship 

Q = Cv 

Q 

^ = Li 

^ L 

Energy 


1 

2 C 


1 

2 L 

Law for 
potential 

KVL 

Gonservation 
of Gharge 

KGL 

Gonservation 
of Mag. 

Elux 

This 

column 

assumes 

AG current 
and voltage 

AG current 
and voltage 

AG current 
and voltage 

AG current 
and voltage 


Table 12.1: Describing electrical circuits in the language of calculus of 
variations. 
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Gircuit Quantity 

Electromagnetic Eield 

Q 

Gharge in G 


- V — 

Displacement flux density in ^ 

V 

Voltage in V 

t 

Electric field intensity in ^ 

T 

Magnetic fiux in Wb 


magnetic fiux density in 

i 

Gurrent in A 


■A 

Magnetic field intensity in ^ 


Table 12.2: Quantities used to describe circuits and electromagnetic fields. 


Using electromagnetics language, four vector fields describe systems: 
^ displacement fiux density in ^ electric field intensity in ^ 
magnetic fiux density in , and magnetic field intensity in These 
electromagnetic fields are generalizations of the circuit parameters charge 
Q, voltage v, magnetic fiux T, and current i respectively as shown in Table 
12.2. However, the electromagnetic fields are functions of position x, y, and 
2 : in addition to time, and they are vector instead of scalar quantities. More 
specifically, displacement fiux density is the charge built up on a surface per 
unit area, and magnetic fiux density is the magnetic fiux through a surface. 
Similarly, electric field intensity is the negative gradient of the voltage, and 
magnetic field intensity is the gradient of the current. We encountered 
these electromagnetic fields when discussing antennas in Ghapter 4. 

A capacitor can store energy in the charge built up between the capac¬ 
itor plates. Analogously, an insulating material with permittivity greater 
than the permittivity of free space, e > eo, can store energy in the dis¬ 
tributed charge separation throughout the material. We can describe the 
energy conversion processes occurring in a capacitor using the language 
of calculus of variations by choosing either charge Q or voltage v as the 
generalized path. Parameters resulting from these choices are shown in the 
second and third column of Table 12.1. Analogously, we can describe the 
energy conversion processes occurring in an insulating material with e > eo 
using the language of calculus of variations by choosing either ^ or ^ as 
the generalized path. Parameters resulting from these choices are shown in 
the second and third column of Table 12.3. The equation of motion that 
results in either case is Gauss’s law for the electric field. 


^3 = 


Pch 


(12.3) 


where pch is charge density. The derivation is beyond the scope of this text, 
however, because it involves applying calculus of variations to quantities 
with multiple independent and dependent variables. Gauss’s law is one of 
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Maxwell’s equations, and it was introduced in Section 1.6.1. In Chapter 2, 
piezoelectric energy conversion devices were discussed, and in Chapter 3, 
pyroelectric and electro-optic energy conversion devices were discussed. All 
of these devices involved converting electrical energy to and from energy 
stored in a material polarization of an insulating material with e > eo. 
Calculus of variations can be used to describe energy conversion in all of 
these devices with either displacement flux density or electric fleld intensity 
as the generalized path. Eor a device made from a material of permittivity 
e with an external electric fleld intensity across it given by the energy 
density stored is in The energy stored in a volume V is found 

by integrating this energy density with respect to volume, and this energy 
stored in a volume is listed in the second to last row of Table 12.3. Notice 
the similarity of the equation for the energy stored in a capacitor (second 
column, second to last box of Table 12.1) and this equation for the energy 
density stored in a material with e > eo (second column second to last box 
of the Table 12.3). 

Energy can also be stored in materials with permeability greater than 
the permeability of free space, /i > (Iq. Hall effect devices and magneto¬ 
hydrodynamic devices were discussed in Chapter 5. These devices are all 
inductor-like, and the parameters used to describe inductive energy con¬ 
version processes in the language of calculus of variations are summarized 
in the last two columns of the Table 12.3. Calculus of variations can be 
used to describe energy conversion processes in these devices with either 
magnetic flux density or magnetic fleld intensity as the generalized path 
and the other choice as the generalized potential. The equation of motion 
resulting from using calculus of variations to describe inductive systems 
corresponds to Gauss’s law for the magnetic fleld, 

^■^ = 0. (12.4) 

The physics of antennas is described by electric and magnetic fields, and 
any of the columns of Table 12.3 can be used to describe energy conver¬ 
sion between electricity and electromagnetic waves in antennas using the 
language of calculus of variations. 



12 RELATING ENERGY GONVERSION PROGESSES 


275 


Energy 

storage 

device 

Dielectric 
Material, 
e > eo 

Dielectric 
Material, 
e > eo 

Magnetic 

Material, 

fl> fiO 

Magnetic 

Material, 

fi> fiO 

Generalized 

Path 

Displacement 

Elux 

Density D 

in -C 
m j ^2 

Electric 

Eield 

Intensity E 
in 

V J 

m Gm 

Magnetic 

Elux 

Density B 

in Wb 
m j ^2 

Magnetic 

Eield 

Intensity H 
in 

A _ J 
ni Wb m 

Generalized 

Potential 

Electric 

field 

Intensity E 
in 

V J 

m Gm 

Displacement 

Elux 

Density U 

in -C 
m j ^2 

Magnetic 

Eield 

Intensity H 
in 

A _ J 
ni Wb m 

Magnetic 

Elux 

Density B 

in Wb 
m j ^2 

Generalized 

Gapacity 

Permittivity 
e in 

E G" 
m .J.m 

1 

€ 

Permeability 
fi in 

H Wb" 
m .l.m 

1 

Gonstitutive 

relation¬ 

ship 


~S = 1~S 

e 


~S='-~S 

Energy 


/vi;|SPrfV 


lAPW 

Law for 
potential 

Gauss’s 

Law for 

i'G 

y ■ D = pch 

Gauss’s 

Law for 

1"% = 

^Pch 

Gauss’s 

Law for 

Mag. 

^ = 0 

Gauss’s 

Law for 

Mag. 

^ = 0 


Table 12.3: Describing electromagnetic systems in the language of calculus 
of variations. 
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12.3 Mechanical Energy Conversion 

The previous section summarized how the language of calculus of variations 
can be applied to electrical and electromagnetic energy conversion devices. 
Similarly, this language can be used to describe energy conversion processes 
occurring in linear springs, torsion springs, moving masses, and flywheels. 

We can convert energy to and from spring potential energy by com¬ 
pressing and releasing a spring. Similarly, we can store or release energy 
from a moving mass by changing its velocity. A flywheel is a device that 
stores energy in a spinning mass. Flywheels are used, in addition to batter¬ 
ies, in some electric and hybrid vehicles because storing rotational kinetic 
energy in a flywheel requires fewer energy conversion processes than storing 
energy in a battery. All of these energy conversion devices can be described 
in the language of calculus of variations with some parameter chosen as the 
generalized path. 

Tables 12.4 and 12.5 summarize the parameters resulting from describ¬ 
ing mechanical energy conversion processes in the language of calculus of 
variations. While electromagnetic systems are described by four vector 
flelds, mechanical systems are described by eight possible vector flelds, and 
they are listed along with their units in Table 12.6. Each column of Tables 

12.4 and 12.5 describes the case of choosing a different vector field from 
Table 12.6 as the generalized path. By comparing across the rows of these 
tables as well as the electrical tables, comparisons can be made between 
the different energy conversion processes. 

In Sec. 11.5, energy conversion in a linear spring was discussed in the 
language of calculus of variations. That example considered the displace¬ 
ment of a point mass m in kg where the generalized path was chosen to 
be displacement x in m. The resulting Euler-Lagrange equation was New¬ 
ton’s second law. Section 11.5 concluded with Table 11.1 summarizing the 
resulting parameters. The third column of the Table 12.4 repeats that 
information. 

Circuit devices are often assumed to be point-like while electromagnetic 
properties of materials, like permittivity and permeability, are specified as 
functions of position. Similarly, mechanical devices can be treated as point¬ 
like or as functions of position. For example, mass is used to describe a 
point-like device while density is used to describe a device that varies with 
position. Researchers studying aerodynamics and fluid dynamics typically 
prefer the latter description. However, in Tables 12.4 and 12.5, point-like 
devices of mass m are assumed. Ideas in these tables can be generalized 
to situations where energy conversion devices are not treated as point-like 
and instead mass and other material properties vary with position. 
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Energy 

storage 

device 

Linear 

Spring 

Linear 

Spring 

Elywheel 

Elywheel 

Generalized 

Path 

'P Eorce in 

— = N 
m 

Displacement 
in m 

^ang 

Angular 

Velocity in 
rad 
s 

Lam 

Angular 
Momentum 
in J-s 

Generalized 

Potential 

Displacement 
"af in m 

P Eorce in 

— = N 
m 

Lam 

Angular 
Momentum 
in J-s 

^ang 

Angular 

Velocity in 
rad 
s 

Generalized 

Gapacity 

K in 

1 • m2 

77 111 X 

¥ in 1 — 

I kgm2 

I in kg ■ m^ 

Gonstitutive 

relation¬ 

ship 

P = Kit 

it — —P 

^anq 

j ■‘-'am 

Lam 

in —^ 

‘^^ang 

Energy 


= 

2 Ki-^ I 

l7[ L , 1“ — 

2^ l^angl 

-2 

Ilf 

2 1 ^am 


Law for 
potential 

Newton’s 
^cond Law 

P = mP 

Newton’s 
^cond Law 

P = mP 

Gonservation 
of Angular 
Momentum 

Gonservation 
of Angular 
Momentum 


Table 12.4: Describing mechanical systems in the language of calculus of 
variations. 
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Energy 

storage 

device 

Moving 

Mass 

Moving 

Mass 

Torsion 

Spring 

Torsion 

Spring 

Generalized 

Path 

Momentum 

in 

kg-m _ J.s 
s m 

it Velocity 

m 
m ^ 

torque in 
N-m _ J 
rad rad 

Angular 
Displace¬ 
ment 9 in 
radians 

Generalized 

Potential 

it Velocity 

m 
m ^ 

Momentum 

in 

kg-m _ J.s 
s m 

Angular 
Displace¬ 
ment 9 in 
radians 

torque in 
N-m _ J 
rad rad 

Generalized 

Gapacity 

m in kg 

— in -p- 

IK in 

rad 

1 • rad^ 

K 111 J 

Gonstitutive 

relation¬ 

ship 

M = mlt 

it = 

m 



Energy 

lrn\ltt = 

1 |aI|2 

2 m 

= 

1 iTtl^ 

2 m 

iK|?|2 = 

1K|?|2 = 

Law for 
potential 

Gonservation 

of 

Momentum 

Gonservation 

of 

Momentum 

Gonservation 
of Torque 

Gonservation 
of Torque 


Table 12.5: Describing more mechanical systems in the language of calcu¬ 
lus of variations. 
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Symbol 

Quantity 

Units 


Eorce 

N 

ill 

Momentum 

kg-m 

S 

V 

Velocity 

m 

s 


Positional displacement 

m 

Lam 

Angular momentum 

J-s 

y 

Angular displacement vector 

rad 


Torque 

N-m 

^ang 

Angular velocity 

rad 

s 


Table 12.6: Vector fields for describing mechanical displacement and fiuid 
fiow. 

The vector fields fisted in Table 12.6 are related by constitutive rela¬ 
tionships: 


M = mP^ 

(12.5) 

f = Kit 

(12.6) 

1^ = Kt 

(12.7) 

Lam '^^ang 

(12.8) 


Equation 12.6 is more familiarly known as Hooke’s law. By analogy to 

the capacitance of Eq. 12.1, the coefficients in these equations are referred 

to in Tables 12.4 and 12.5 as generalized capacity, and they represent the 

ability to store energy in the device. The constant m in Eq. 12.5 is mass 

in kg. The constant K in Eq. 12.6 is spring constant in The constant 

IK in Eq. 12.7 is torsion spring constant in ——j. The constant I in Eq. 

r8,Q.i8,ns 

12.8 is moment of inertia in units kg-m^. A point mass rotating around the 
origin has a moment of inertia I = m|T^P where IT^I is the distance from 
the mass to the origin. A solid shape has moment of inertia 


I 


(R = 




(12.9) 
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Interestingly, there is a close relationship between the quantities in Ta¬ 
bles 12.3 and 12.5. Maxwell’s equations, first introduced in Section 1.6.1, 
relate the four electromagnetic field parameters. Assuming no sources, 
"^ = 0 and pch = 0, Maxwell’s equations can be written: 



(12.10) 


(12.11) 


(12.12) 


(12.13) 

The last two relationships. Gauss’s laws, result directly from using calculus 
of variations to set up the Euler-Lagrange equation and solving for the 
corresponding equation of motion. We can replace electromagnetic vector 
fields in the source-free version of Maxwell’s equations by mechanical fields 

according to the transformation: 


^ ^ ill 

(12.14) 

^ ^ 

(12.15) 

^ 

(12.16) 


(12.17) 

The transformation of Eqs. 12.14 -12.17 leads to set of equations accurately 

describing relationships between these mechanical fields. 


at 

(12.18) 

at 

(12.19) 

O 

tl> 

(12.20) 




( 12 . 21 ) 
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The last rows of Tables 12.4 and 12.5 list the relationship that results 
when an energy conversion device is described in the language of calcu¬ 
lus of variations, the Euler-Lagrange equation is set up, and the Euler- 
Lagrange equation is solved for the equation of motion. The laws that 
result, Newton’s second law, conservation of momentum, conservation of 
angular momentum, and conservation of torque, are fundamental ideas of 
mechanics. 
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12.4 Thermodynamic Energy Conversion 

Four fundamental thermodynamic properties were introduced in Section 
8.2: volume V, pressure P, temperature T, and entropy S. Many devices 
convert between some form of energy and either energy stored in a confined 
volume, energy stored in a material under pressure, energy in a tempera¬ 
ture difference, or energy of a disordered system. We can describe energy 
conversion processes in these devices using the language of calculus of vari¬ 
ations with one of these parameters, V, P, T, or S, as the generalized 
path and another as the generalized potential. Table 12.7 summarizes the 
results. 

Many sensors convert energy between electrical energy and energy stored 
in a volume, pressure, or temperature difference. A capacitive gauge can 
measure the volume of liquid fuel versus vapor in the tank of an aircraft. 
Strain gauges and Piranhi hot wire gauges (Sec. 10.5), for example, are 
sensors that can measure pressure on solids or in gases. Pyroelectric de¬ 
tectors (Sec. 3.2), thermoelectric detectors (Sec. 8.8), thermionic devices 
(Sec. 10.2), and resistance temperature devices (Sec. 10.5) can be used to 
sense temperature changes. 

Many other energy conversion devices convert between energy stored in 
a confined volume, energy stored in a material under pressure, or energy 
in a temperature difference and another form of energy without involving 
electricity. For example, if you tie a balloon to a toy car then release the 
air in the balloon, the toy car will move forward. Energy stored in the 
confined volume of the balloon, as well as in the stretched rubber of the 
balloon, is converted to kinetic energy of the toy car. An aerator or squirt 
bottle converts energy of a pressure difference to kinetic energy of a liquid. 
An eye dropper converts energy of a pressure difference to gravitational 
potential energy. An airfoil converts a pressure difference to kinetic energy 
in the form of lift. A piston converts energy of a gas under pressure to 
kinetic energy. As discussed in Sec. 10.6, a constricted pipe, or a weir, 
converts energy of a pressure difference in a flowing liquid to kinetic energy 
of the liquid. A baseball thrown as a curve ball converts the rotational 
energy of the rotating ball into a pressure differential to deflect the ball’s 
path [162, p. 350]. A Sterling engine converts a temperature difference to 
kinetic energy. 

Calculus of variations can be used to gain insights into thermodynamic 
energy conversion processes in these devices. The first step in applying 
the ideas of calculus of variations is to identify an initial and final form 
of energy. The Lagrangian is the difference between these forms of energy 
as a function of time. Some authors choose the Lagrangian as an entropy 
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Energy 

storage 

device 

A balloon 
filled with 
air confined 
to a finite 
volume 

A 

compressed 

piston 

A cup of 
hot liquid 
(hot 

compared 
to the temp 
of the 
room) 

A container 
with two 
pure gases 
separated 
by a barrier 

Generalized 

Path 

Volume V 
in m^ 

Pressure P 
in Pa 

Temperature 
T in K 

Entropy S 

in J 

Generalized 

Potential 

Pressure P 
in Pa = 

Volume V 

1 

m m = ^ 

Entropy S 

in J 

Temperature 
T in K 

Generalized 

Gapacity 

Y = in 

B ap 

m® 

.1 

B 9P • 

V dY 

J 

m® 

PI = M in 

Cv dT 

g.K^ 

.1 

Cv _ dS • 

T dT 

J 

Gonstitutive 

relation¬ 

ship 

AV = 

-lAP 

AP = 

-|AV 

AT = ^AS 

AS = 
c^AT 

Energy (int 
expression) 

Energy 

(const. 

potential) 

/VdP 

VAP 

/PdV 

PAV 

jTdS 

TAS 

J SdT 

SAT 

Law for 
potential 

Bernoulli’s 

Equation 


Second Law 
of Thermo¬ 
dynamics 


This 

column 

assumes 

constant 

S,T 

constant 

S,T 

constant 

P,V 

constant 

P,V 


Table 12.7: Describing thermodynamic systems in the language of calculus 
of variations. 
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instead of an energy [170] [171], but throughout this text Lagrangian is 
assumed to represent an energy as described in Ch. 11. 

Assume that only one energy conversion process occurs in a device. Also 
assume that if we know three (not two) of the four thermodynamic param¬ 
eters, we can calculate the fourth. Additionally, assume small amounts 
of energy are involved, and the energy conversion process occurs in the 
presence of a large external thermodynamic reservoir of energy. 

As with the discussion of the previous tables, each column of Table 
12.7 details the parameters of calculus of variations for a different choice 
of generalized path. In order, the columns can be used to describe energy 
storage in a gas confined to a finite volume, a material under pressure, a 
temperature differential, or an ordered system. The rows are labeled in 
the same way as in the previous tables of this chapter so that analogies 
between the systems can be drawn. 

Energy can be stored and released from a gas confined to a finite volume 
and a gas under pressure. These related energy conversion processes are 
detailed in the second and third columns of Table 12.7 respectively. The 
second column specifies parameters of calculus of variations with volume 
chosen as the generalized path and pressure as the generalized potential. 
The third column specifies parameters with pressure chosen as the gen¬ 
eralized path and volume as the generalized potential. In reality, energy 
conversion processes involving changes in the pressure and volume of a gas 
are unlikely to occur without a change in temperature or entropy of the 
system simultaneously occurring. Resistive heating, friction, gravity, and 
all other energy conversion processes that could simultaneously occur are 
ignored. Temperature and entropy are explicitly assumed to remain fixed, 
and these assumptions are listed in the last row of the table for empha¬ 
sis. These columns can apply to energy conversion in liquids and solids in 
addition to gases. Using the choice of variables in the second column, the 
capacity to store energy is given by ^ where B is the bulk modulus in units 
pascals, and it is a measure of the ability of a compressed material to store 
energy [103]. Bulk modulus was introduced in Section 8.2. Using volume as 
the generalized path, the Euler-Lagrange equation can be set up and solved 
for the equation of motion. All terms of the resulting equation of motion 
have the units of pressure, and the equation of motion is a statement of 
Bernoulli’s equation, an idea discussed in Section 10.6. 

The fourth and fifth columns of Table 12.7 specify parameters of calcu¬ 
lus of variations with temperature and entropy chosen as the generalized 
path respectively. A cup of hot liquid stores energy. Similarly, a container 
with two pure gases separated by a barrier stores energy. The system is 
in a more ordered state before the barrier is removed than after, and it 
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would take energy to restore the system to the ordered state. Both of these 
systems can be described by the language of calculus of variations. As de¬ 
tailed in the fourth column, temperature can be chosen as the generalized 
path and entropy can be chosen as the generalized potential. Alternatively 
as detailed in the fifth column, entropy can be chosen as the generalized 
path and temperature can be chosen as the generalized potential. Both 
of these columns assume that the pressure and volume remain constant. 
The quantity which shows up in these columns, is the specific heat at 

constant volume in units and it was introduced in Sec. 8.3. 

g.K’ 

The equation of motion that results when temperature is chosen as the 
path and entropy is chosen as the generalized potential is a statement of 
conservation of entropy, and each term of this equation has the units of 
entropy. This relationship is more commonly known as the second law of 
thermodynamics, and it shows up in the second to last row of Table 12.7. 
More commonly, the law is written for a closed system as [109, p. 236], 






( 12 . 22 ) 


In words, it says the change in entropy within a control mass is equal to 
the sum of the entropy out of the control mass due to heat transfer plus 
the entropy produced by the system. 


(change in entropy) = (entropy out due to heat) + (entropy produced) 


A system can become more organized or more disordered, so AR may 
be positive or negative. If energy is supplied in or out, entropy can be 
transfered in or out of a system, so the quantity / ^ may be positive or 
negative. 

Energy is fisted in the third to last row of Table 12.7 in two different 
forms. The first expression is an integral expression. Eor example, you 
can integrate the volume with respect to pressure to find the energy of a 
system. 


E 


(12.23) 


Alternatively, the second expression 


AE = VAP 


(12.24) 


can be used to find change in energy in the case when volume is not a 
strong function of pressure over a small element. 
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12.5 Chemical Energy Conversion 

Batteries and fuel cells store energy in the chemical bonds of atoms. These 
devices were studied in Chapter 9. Table 12.8 details how to describe 
the physics of these chemical energy storage devices using the language of 
calculus of variations. 

Sometimes chemists discuss macroscopic systems and describe charge 

C 

distribution in a material by charge density pch in units In other cases, 
chemists study microscopic systems, where they are more interested in the 
number of electrons N and the distribution of these electrons around an 
atom. The second and third columns of Table 12.8 specify how to describe 
the macroscopic systems in the language of calculus of variations while the 
last two columns specify how to describe the microscopic systems. 

In the second column of Table 12.8, the generalized path is pch and the 
generalized potential is the redox potential Vrp in volts. There is a close 
relationship between the choice of variables specified in the second column 
of Table 12.8 and the choices specified in the second columns of Table 12.1 
and 12.3. More specifically, the generalized path described in the second 
column of Table 12.1 is charge Q in coulombs, where charge is the integral 
of the charge density with respect to volume. 

Q = J Pchd'V 

The generalized path described in the second column of Table 12.3 is dis¬ 
placement flux density ^ in units In the third column of Table 12.8, 
the opposite choice is made with Vrp for the generalized path and pch for 
the generalized potential. In Chapter 13, we consider a calculus of varia¬ 
tions problem with this choice of variables in more detail to solve for the 
electron density around an atom. 

Another way to apply the language of calculus of variations to chemi¬ 
cal energy storage systems is to choose the number of electrons N as the 
generalized path and the chemical potential Pchem as the generalized po¬ 
tential [172]. This situation is described in the fourth column of the Table 
12.8. We could instead choose pchem as the generalized path and N as the 
generalized potential, and this situation is detailed in the last column of 
Table 12.8. Reference [172] details using calculus of variations with this 
choice of variables. Chemical potential is also known as the Fermi energy 
at T = 0 K, and it was discussed in Sections 6.3 and 9.2.3. It represents 
the average between the highest occupied and lowest unoccupied energy 
levels. The quantity Eg, which shows up in the fourth row of the table, is 
the energy gap in joules. 


(12.25) 
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Energy 

storage 

device 

Battery, 
fuel cell 

Battery, 
fuel cell 

Battery, 
fuel cell, 
chemical 
bonds of an 

atom 

Battery, 
fuel cell, 
chemical 
bonds of an 

atom 

Generalized 

Path 

Gharge 

density p^h 

in^ 

nP 

Redox 

potential 

(voltage) 

Vrp in volts 

Number of 
electrons N 

Ghemical 

potential 

P'chem, m 

J 

atom 

Generalized 

Potential 

Redox 

potential 

(voltage) 

Vrp in volts 

Gharge 

Density pch 

in^ 

hP 

Ghemical 

potential 

pchem m 

J 

atom 

Number of 
electrons N 

Generalized 

Gapacity 

Gapacitance 
C in farads 

1 

c 

Inverse of 
energy gap 

1 dN 

Eg 

Energy gap 

(9/Xc/fem 

ON 

in J 

Gonstitutive 

relation¬ 

ship 

/ Pchdy = 
CVrp 

Vrp 

^ / PchdN 

AN = 

^l^chem 

EgAN 

Energy 

Jy Pch^rpd^ 

Jy PchVrpd^ 

^ f^chem 

^ f^chem 

Law for 
potential 

Nernst eq. 
(KVL) 

Gonservation 
of Gharge 

Nernst eq. 
(KVL) 

Gonservation 
of Gharge 

This 

column 

assumes 

no variation 
in 0 or 0 

no variation 
in 0 or 0 

no variation 
in 6^ or 0 

no variation 
in 0 or 0 


Table 12.8: Describing chemical systems in the language of calculus of 
variations. 
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12.6 Problems 


12.6 Problems 


12.1. Match each device, or device component, with the material or mate¬ 
rials it is often made from. 


Device or device 
component 


Material 

1. Photovoltaic device 


A. Lead zirconium titanate 

2. Piezoelectric device 


B. Bismuth telluride 

3. Battery anode 


C. Cadmium telluride 

4. Thermoelectric device 


D. Mica, Quartz 

5. Dielectric between 
capacitor plates 


E. Zinc, Lithium 


12.2. For each device or device component listed in the problem above, 
indicate whether it is typically made from a conductor, dielectric, or 
semiconductor. 

12.3. For each of the devices below, list a material that the device is com¬ 
monly made from. 

• Photovoltaic Device 

• Hall Effect Device 

• Piezoelectric Device 

• Capacitor 

12.4. Appendix B lists multiple units along with whether or not they are 
SI base units. The joule, volt, and pascal are all SI derived units. 
Express each of these units in terms of SI base units. 
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12.5. Match the effect with the definition. 


1. When an optical field is applied to a 
dielectric material, a material polarization 
develops in the material. 


A. 

Hall effect 

2. When an optical field is applied to a 


B. 

semiconductor junction, a voltage develops 


Electro- optic 

across the junction. 


effect 

3. When a current passes through a uniform 


C. 

material which has a temperature gradient. 


Photovoltaic 

heating or cooling will occur 


effect 

4. When a mechanical stress is applied to a 


D. 

dielectric material, a material polarization 
develops in the material. 


Seebeck effect 

5. When the different sides of a junction are 


E. 

held at different temperatures, a voltage 


Piezoelectric 

develops across the junction. 


effect 

6. When an external magnetic field is 


F. 

applied across a conductor or semiconductor 


Thomson 

with current fiowing through it, a charge 
builds up perpendicular to the current and 
external magnetic field. 


effect 
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12.6 Problems 


12.6. Match the device with its definition. 


1. A device which converts electromagnetic 
(often optical) energy directly to electricity 


A. 

Fuel Cell 

2 . A device made from diodes of two 


B. 

dissimilar materials which converts a 


Photovoltaic 

temperature differential to electricity 


Device 

3. A device which converts chemical energy 


C. 

to electrical energy through the oxidation of 


Piezoelectric 

a fuel 


Device 

4 . A device which converts mechanical stress 


D. 

directly to electricity 


Pyroelectric 

Device 

5. A device made from a crystal without a 


E. Thermo¬ 

center of symmetry which converts a 


electric 

temperature differential to electricity 


Device 
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13 Thomas Fermi Analysis 

13.1 Introduction 

Where are the electrons found around an atom? This question is difficult for 
a few reasons. First, at temperatures above absolute zero, electrons are in 
continual motion. Second, the Heisenberg uncertainty principle tells us that 
we can never know the position and momentum of electrons simultaneously 
with complete accuracy. However, this question isn’t hopeless. We can 
find the charge density pch which tells us, statistically on average, where 
the electrons are most likely to be found. Understanding the distribution 
of electrons in a material is vital to understanding the chemical properties, 
such as the strength of chemical bonds, as well as the electrical properties, 
such as how much energy is required to remove electrons. 

To answer this question, we will use calculus of variations. The gener¬ 
alized path will be voltage V, and the generalized potential will be charge 
density pch- A Lagrangian describes an energy difference, and the La- 
grangian will have the form 


( 13 , 1 ) 

The path found in nature is the one that minimizes the action. 

rr2 

6 Cdr = 0 (13.2) 

J ri 

In this problem, the independent variable is position, not time. We will 
set up the Euler-Lagrange equation then solve it to find the equation of 
motion. 

Most of this chapter consists of a derivation of the resulting equation 
of motion called the Thomas Fermi equation. With a bit of algebra, we 
can find both the voltage and the charge density around the atom from the 
solution to the Thomas Fermi equation. The procedure is as follows. 


• Describe the first form of energy, Ecouiomb e nud + Ee e interact , in 
terms of path V. The resulting energy density is 


E 


Coulomb e nucl 


V 


+ 


Ee 


e interact 


V 



(13.3) 


where e represents permittivity and V represents volume. 



292 


13.1 Introduction 


• Describe the second form of energy Ekinetic e in terms of path V. The 
resulting energy density is 

Ekinetic e _ ^^^ 5/2 

where cq is a constant. This step will require the idea of reciprocal 
space. 

• Write down the Hamiltonian E[ (r, V, and Lagrangian £ (r, V, . 

• Set up the Euler-Lagrange equation. 


• Solve the Euler-Lagrange equation for the equation of motion. The 
result is 

- eV^E = 0. (13.6) 


• Change variables to clean up the equation of motion. The resulting 
equation is called the Thomas Fermi equation. 


A ^ t-l/2 3/2 

dt^ ^ 


(13.7) 


• Voltage and charge density are algebraically related to the quantity 
y in the equation above. 

To attempt to find charge density and voltage as a function of position 
r from the center of the atom, we will have to make some rather drastic 
assumptions. This analysis follows works of Thomas [173] and Eermi [174] 
which were originally completed around 1927. This derivation is discussed 
by numerous other authors as well [6] [46] [136] [175]. Because of the severe 
assumptions made below, the results will not be very accurate. However, 
more accurate numerical calculations are based on improved versions of the 
techniques established by Thomas and Eermi. We are discussing the most 
simplified version of the derivation, but this is the basis of more accurate 
approaches. 
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13.2 Preliminary Ideas 

13.2.1 Derivatives and Integrals of Vectors in Spherical Coordi¬ 
nates 


The derivation of the Thomas Fermi equation involves derivatives of vectors 
in spherical coordinates. For more details on derivatives and vectors see 
[11, ch. 1], Consider a scalar function described in spherical coordinates, 

V = V{^) = V{r,e,(j)). (13.8) 


The gradient of V (r, 9, 0) is defined 


'^V 


dV ^ IdV ^ 1 dV ^ 

--^ 

or r 09 r sm 9 Ocp 


(13.9) 


Gradient was introduced in Section 1.6.1. It returns a vector which points 
in the direction of largest change in the function. The Laplacian is defined 
in spherical coordinates as 


1 d ( 2<9V\ 1 d ( . dV\ 1 

2 (9r \ (9r / sin 9 89 \ 89 ) sin^ 9 8<p‘^ 


Qualitatively, the Laplacian of a scalar is the second derivative with re¬ 
spect to spatial position. In the derivations of this chapter, we encounter 
only functions which are uniform with respect to 9 and 0. For functions 
of the form V = V{r), the formulas for gradient and Laplacian simplify 
significantly. 


dV ^ 

VV = —ar 
8r 


1 8 


r" — 


rjrt‘2i 


,8V 


8r 


We will also need the vector identity of Eq. 1.10, 


(13.11) 

(13.12) 


(13.13) 


A differential volume element in spherical coordinates is given by 

dV = r^sind dr d9 dcj). (13.14) 


A volume integral of the function V{r,9,(l)) over a sphere of radius 1 cen¬ 
tered at the origin is denoted 



r-27r 


V (r, 9, (fyr"^ sin 9 dr d9 dcf). 


i=0 


(13.15) 
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Assuming V doesn’t depend on 0 or <p, the integral is separable. 

•27V \ 

sin 9 d9 d(l)\ / V{r)r‘^dr = Att / V{r)r‘^dr (13.16) 
1=0 / J r=0 J r=0 

A sphere of radius r has volume fvrr^. 

13.2.2 Notation 

Writing this text without overloading variables has been a challenge. For 
example, V is the logical choice for denoting voltage, volume, and velocity. 
Up until now, the context offered clues to the meaning of symbols. However 
in this chapter, we will encounter equations involving both energy and 
electric field, equations involving both voltage and volume, and equations 
involving both mass and momentum. To help avoid confusion from the 
notation. Table 13.1 shows an excerpt of the variable fist from Appendix 
A. This table does not fist all quantities we will encounter. However, it 
highlights some of the more confusing ones. 

In this chapter, we will encounter many quantities which vary with po¬ 
sition. We will not encounter any quantities which vary with time. There¬ 
fore, voltage is denoted by a capital letter, not a lowercase letter. Voltage 
is a function of r, which denotes position in spherical coordinates. As¬ 
sume that the origin of the coordinate system is at the center of the atom 
under consideration. Voltage is always specified with respect to some ref¬ 
erence level called ground, so assume this zero volt reference level occurs 
at r = cx). Also assume there is no 6* or 0 dependence of the voltage. 
Therefore, U(T^) = V{r) represents voltage. 
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Symbol 

Quantity 

SI Units 

s/v/c 

Comments 

E 

Energy 

J = Nm 

s 



Electric field 
intensity 

V 

m 

V 


Ef 

Eermi energy level 

J 

s 

Also called 

Eermi level 

k 

Wave vector 

m“^ 

V 


kf 

Eermi wave vector 

m“^ 

s 


m 

Mass 

kg 

s 


M 

Generalized 

momentum 

* 

s 

Many authors 
use p 


Momentum 

kg-m 

S 

V 

Many authors 
use jf' 

N 

(Total) number of 
electrons per atom 

electrons 

atom 

s 


V 

Voltage (AC or 
time varying) 

V 

s 


V 

Velocity 

m 

s 

V 


V 

Voltage (DC) 

V 

s 


V 

Volume 

m'^ 

s 


l^chem 

Chemical potential 

J 

atom 

s 


Pch 

Charge density 

C 

m3 

s 



Table 13.1: Variable list. 
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13.2.3 Reciprocal Space Concepts 

The idea of reciprocal space was introduced in Section 6.4 in the context 
of crystalline materials. We can describe the location of atoms in a crystal, 
for example, as a function of position where position is measured in 
meters. In this chapter, we are interested in individual atoms instead of 
crystals composed of many atoms. We can plot quantities like energy E{~^) 
or voltage V{1^) as a function of position. Figure 6.11, for example, plots 
energy versus position inside a diode. In Section 6.4, the idea of wave 
vector k in units of m“^ was introduced. Wave vector represents the 
spatial frequency. We saw that we could plot energy or other quantities 
as a function of wave vector, and Fig. 6.8 is an example of such a plot. 
We will need the idea of wave vector in this chapter because we describe a 
situation where we do not know how the energy varies with position, but 
we do know something about how the energy varies with wave vector. 

13.3 Derivation of the Lagrangian 

The purpose of this chapter is to find the voltage V{r) and the charge 
density Pchij') around an atom, and we will use calculus of variations to 
accomplish this task. We need to make some rather severe assumptions to 
make this problem manageable. Consider an isolated neutral atom with 
many electrons around it. Assume T ^ 0 K, so all electrons occupy the 
lowest possible energy levels. Assume the atom is spherically symmetric. 
All of the quantities we encounter, such as voltage, charge density, and 
Lagrangian, vary with r but do not vary with 9 or 0. We will use spherical 
coordinates with the origin at the nucleus of the atom. While quantities 
vary with position, assume no quantities vary with time. The charge density 
Pchij") tells us where the electrons are most likely on average to be found. 
It is related to the quantum mechanical wave function, ip, by 

pch = -q-\'P? (13.17) 

where q is the magnitude of the charge of an electron. Assume that all of 
the electrons surrounding the atom are distributed uniformly and can be 
treated as if they were a uniform electron cloud of some charge density. 

Pick one of the electrons of the atom, and consider what happens when 
the electron is moved radially in and out. Figure 13.1 illustrates this sit¬ 
uation. As the electron moves, energy conversion occurs. The goal of this 
section is to write down the Hamiltonian and Lagrangian for this energy 
conversion process. We write these quantities in the units of energy per 
unit volume per valence electron under consideration. 
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Figure 13.1: Illustration of an atom. 


To understand what happens when the electron is moved, consider the 
energy of the atom in more detail. Coulomb’s law, introduced in Eq. 
1.4, tells us that charged objects exert forces on other charged objects. 
More specifically, the electric field intensity ^ due to a point charge of Q 
coulombs a distance r away surrounded by a material with permittivity e 
is given by 




Qcir 


(13.18) 


The atom is composed of N positively charged protons. The electron under 
consideration feels an attractive Coulomb force due to these protons. Ad¬ 
ditionally, the atom has N electrons, and iV — 1 of these exert a repulsive 
Coulomb force on the electron under consideration. Since a charge sepa¬ 
ration and electric field exist, energy is stored. Call the component of the 
energy of the atom due to the Coulomb interaction between the protons 
of the nucleus and the electron under consideration Ecouiomt e nuci- Call 
the Coulomb interaction between the electron under consideration and all 
other electrons e interact- The atom also has kinetic energy. Call the 
kinetic energy of the nucleus E^ineUc nuci and the kinetic energy of all of the 
electrons Ekinetic e- The energy of the atom is the sum of all of these terms. 


E. 


atom 


= E, 


Coulomb e nuci. 


+ E, 


kinetic nuci 


+ E, 


e e interact 


+ E, 


kinetic e 


(13.19) 


Energy due to spin of the electrons and protons is ignored as is energy due 
to interaction with any other nearby charged objects. At T ^ 0 K, the 
kinetic energy of the nucleus will be close to zero, so we can ignore the 
term, Ekinetic nud ~ 0. The quantity Ekinetic e cannot be exactly zero. In 
Chapter 6 we plotted energy level diagrams for electrons around an atom. 
Even at T = 0 K, electrons have some internal energy, and this energy is 
denoted by the energy level occupied. 
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If we have a large atom with many electrons around it, the Coulomb 
interaction between any one electron and the nucleus is shielded by the 
Coulomb interaction from all other electrons. More specifically, suppose 
we have an isolated atom with N protons in the nucleus and N electrons 
around it. If we pick one of the electrons, Ecouiomb e nuci for that electron 
describes the energy stored in the electric field due to the charge separation 
between the nucleus of positive charge Nq and that electron. However, 
there are also N — 1 other electrons which have a negative charge. The 
term e interact describes the energy stored in the electric field due to 
the charge separation between the iV — 1 other electrons and the electron 
under consideration. These terms somewhat cancel each other out because 
the electron under consideration interacts with N protons each of positive 
charge q and N — 1 electrons each of negative charge —q. However, the 
terms do not go away completely. Calculating 


E, 


Coulomb e nucl 


E, 


e e interact 


(13.20) 


is complicated because the electrons are in motion, and we do not really 
know where they are or even where they are most likely to be found. In 
fact, we are trying to solve for where they are likely to be found. 

As we move the electron under consideration in and out radially, energy 
is transferred between {^E(jQy^iQ^}j g nuci T E^ g interact) and e* The 

Hamiltonian is the sum of these two forms of energy per unit volume, 
and the Lagrangian is the difference of these two forms of energy per unit 
volume. Both quantities have the units Choose voltage V{r) as the 
generalized path and charge density Pchir) as the generalized potential. 
The independent variable of these quantities is radial position r, not time. 
We can now write the Hamiltonian and Lagrangian. 


TT ( Tr dV\ I g fiucl , Eg g interact \ , e 

= (—V—+ V j + 

^ I j T dV \ ( E(jQ^iQfYib e nuci , Eq g interact \ -^/emetic e 

—V—+ V )-^r- 

The next step is to write 


Ecoulomb e nuci ^ Eg g interact 


V 


V 


(13.21) 

(13.22) 

(13.23) 


in terms of the path V. As detailed in Table 12.3, the energy density due 
to an electric field ^ is given by 


E 

V 



(13.24) 
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Remember that E represents energy while E represents electric field. Elec¬ 
tric field is the negative gradient of the voltage V{r). 

^ = -^V. (13.25) 


We can combine these expressions and Eq. 13.13 to write the first term of 
the Hamiltonian and the Lagrangian in terms of the generalized path. 


E 


Coulomb e nucl 


V 


+ 


E 


e e interact 


V 



(13.26) 


H 

C 



+ 


kinetic e 

~Y 


kinetic e 

~Y 


(13.27) 

(13.28) 


The next task is to describe the remaining term as a function 

of the generalized path too. This task is a bit more challenging. We 
continue to take the approach of making severe approximations until it is 
manageable. We need to express Pchij') as a function of V{r). Then with 
some algebra, ^ can be written purely as a function of V{r). 

We want to generalize about the kinetic energy of the electrons. How¬ 
ever, each electron has its own velocity ^ and momentum M. These 
quantities depend on position 




TOjr -|- OttQ (jyCLfj) 


(13.29) 


in some unknown way. Eurthermore, the calculation of « depends 

on charge density Pchir), which is the unknown quantity we are trying to 
find. We have more luck by describing these quantities in reciprocal space, 
introduced in Sec. 6.4. Position is denoted in reciprocal space by a wave 
vector 

k = far + Oae + 4>a^. (13.30) 

We can describe the properties of a material by describing how they 
vary with position in real space. Eor example, Pchij') represents the charge 
density of electrons as a function of distance r from the center of the atom. 
We may be interested in how other quantities, such as the energy required 
to rip off an electron or the kinetic energy internal to an electron, vary with 
position in real space too. Instead of describing how quantities vary with 
position in real space, we can describe how quantities vary with spatial 
frequency of electrons. This is the idea behind representing quantities 
in reciprocal space. We may be interested in how the charge density of 
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electrons varies as a function of the spatial frequency of charges in a crystal 
or other material, and this is the idea represented by functions of wave 


vector such as pch ^ j- trying to solve for charge density Pchix)- 

We expect that electrons are more likely to be found at certain distances 
r from the center of the atom than at other distances. However, there is 

no pattern to the charge density as a function of wave vector, p^h ( k 

—^ 

Assume that pch is roughly constant with respect to | /c | up to some level. 
With some more work, this assumption will allow us to solve for charge 
density pch{r)- 

The kinetic energy of a single electron is given by 


E, 


kinetic e 




(13.31) 


where m is the mass of the electron. We can write this energy in terms 
of momentum, M = mV. (Note that momentum M and generalized mo¬ 
mentum M are different and have different units.) 


E, 


kinetic e 


2m 


(13.32) 


We do not know how the energy varies as a function of position r. Instead, 




we can write the energy as a function of the crystal momentum M 


-p- 


crystal 


or the wave vector k , and we know something about the variation of these 
quantities. Crystal momentum is equal to the wave vector scaled by the 
Planck constant. ^ 

crystal It k (13.33) 


id 


kgm 


It has the units of momentum —, and it was introduced in Sec. 6.4.2. 


The kinetic energy of one electron as a function of the crystal momentum 
is given by 

‘ M 

(13.34) 


E, 


kinetic e 


crystal 


h\t 


e~ 2m 2m 

A vector in reciprocal space is represented Eq. 13.30, and Eq. 13.34 can 
be simplified because we are assuming spherical symmetry 0 = 0 = 0. 
The magnitude of the wave vector becomes \ k \ = r, and we can write the 
energy as 

Ekinetic e Hr 


2m 


(13.35) 


Just as each electron has its own momentum m| V|, each electron has its 




own crystal momentum h\ k \ . However, we know some information about 
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the wave vector \ k \ oi the electrons in the atom. At T = 0 K, electrons 
occupy the lowest allowed energy states. Energy states are occupied up to 
some highest occupied state called the Fermi energy Ef. While electrical 
engineers use the term Eermi energy, chemists sometimes use the term 
ehemieal potential fichem- The lowest energy states, are occupied while the 
higher ones are empty. Similarly, wave vectors are occupied up to some 
highest occupied wave vector called the Fermi wave veetor kj. 


-^1 I filled state r < kf 
[empty state f > kf 


(13.36) 


The Eermi energy and the Eermi wave vector are related by 

Ef = - k. 13.37 

We use the idea of reciprocal space to write an expression for the kinetic 
energy of the electrons per unit volume [136, p. 49]. The kinetic energy 
due to any one electron as a function of position in reciprocal space is 
given by Eq. 13.35. Note that at each value oi \ k \ = r, the electron has 
a different kinetic energy. To find the kinetic energy per unit volume due 
to all electrons, we integrate over all | /c | = r in spherical coordinates that 
are occupied by electrons, and then we divide by the volume occupied in 
k space. 




kinetic e 


V 


vol. occupied in k space 
/filled k levels ( ^ 


_ § _ ^ 

volume J 


d (vol. all k space) 


(13.38) 


The number of electrons per unit volume is given by 

( (13.39) 
\ volume / q 

The volume occupied in reciprocal space is ^Tikj, the volume of a sphere 
of radius kf. 


E, 


kinetic e 


V 




^ 2^2 

/filled k levels V 


Pch 


d (vol. all k space) 
(13.40) 


A differential element of the volume is expressed as 
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Ehi' 


kinetic e 


V 


iTlkj 


'filled k levels 



sin 6 dr dd d(j) 


(13.42) 

As described above, electrons occupy states in reciprocal space only with 
0 <f < kf. 


dd 


kinetic e 


V 




ckf C7V c2n 

lr=0 J9=0 J6=0 


2m 


Pch 


sin 6 dr dd d- 


(13.43) 

The integral above can be evaluated directly. The first step to evaluate 
it is to pull constants outside. As described above, pch varies with r but 
not r, so it can be pulled outside the integral too. 


E, 


kinetic e 
~¥ 


—1 h^p, 


'ch 


r-27r 


2mq 7^==, 


sin 6 dr dO dcf) 


0 Je=o J<h=o 


(13.44) 


The integral separates and can be evaluated. 




kinetic e 


-1 di^Pch 


V 


dnk^r 2mq 


'•2tt 


sind dd d(j) 


dd 


kinetic e 


' e =0 J 4)=0 

-1 1^‘^Pch 


-kf 


r^dr 


I r=0 


V 


d-jrkj 2mq 


4 .|T 


E 


kinetic e 


-^pchkjff 


(13.45) 

(13.46) 

V. (13.47) 

V lOmg 

Charge density is a function of position in real space r, and we are in 
the process of solving for this function, pch{r). However, it also depends on 
the Fermi energy Ef, and hence Fermi wave vector kf, for the atom. Next, 
we find the relationship between p^h and kf. Two electrons are allowed per 
energy level (spin up and spin down), hence per filled k state. The number 
of filled states per atom in reciprocal space is related to the charge density. 


pch = -‘2q 


no. filled k states \ 
unit vol. in k space ) 


(13.48) 


In Sec. 6.4.1, we saw that a primitive cell in reciprocal space was (27r)^ 
times the primitive cell in real space, so 


(unit vol. k space) = (27r)^ ■ (unit vol. real space) = (27r)^ 


(13.49) 
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We know something about the wave vectors of filled states in reciprocal 
space. At T = 0 K, the lowest states are filled, and all others are empty, 
and they are filled up to a radius of kf. The volume of a sphere of radius 
kf is given by ^Tikj, and this represents the number of filled k states per 
volume of reciprocal space. We can therefore simplify the expression above. 




Pch 

q 


kf = 


(13.50) 

(13.51) 

(13.52) 


We want to write as a function of generalized path V. We can 

now achieve this task by combining Eqs. 13.47 and 13.52. 


Ekinetic e 

V ' 

Ekinetic e 

V 


-3fi2 

lOmg 


Pch 


.3^2 X 2/3 
Pch 

q 


—3h^ f—3n 


lOmq 


q 


2/3 


5/3 

Pch 


(13.53) 


(13.54) 


Electrical energy is the product of charge and voltage. More specifically, 
from Eq. 2.8, it is given by 

E = IqV. 


Electrical energy density is then given by 

^-1 T/ 

V 2^""^ ■ 

Use Eq. 13.56 to relate pch and V. 


Ekinetic e 1 -rr 

^ = l^Pch = 


-37r^ 

q 


2/3 


5/3 


Pch 


(13.55) 


(13.56) 


(13.57) 


We have now related the generalized path and the generalized potential. 


U = 


-3fi2 


Pch 


bmq 

-5mq 


3fi2 


-37r2 


-37r^ 


2/3 


2/3 

Pch 


-2/3' 


3/2 


U3/2 


(13.58) 

(13.59) 
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3^2 J 


3/2 


-Q 

Stt^ 


Pch 

Finally, we can write ^ as a function of V. 

3/2 


■ 1 / 3/2 


(13.60) 


F/i,,' 


kinetic e 


V 


-5mg^' 
3^2 J 


-g 

37r2 


1/3/2 


(13.61) 


Notice that the quantity in brackets above is constant. The coefficient cq 
is defined from the term in brackets. 


Co = 


-5mq^ 
3fi2 ) 


3/2 


-g 

37r2 


^kinetic e 

V 




(13.62) 


(13.63) 


We now can describe all of the terms of the Lagrangian in terms of our 
generalized path. 


Ecoulomb e nucl , F/g g interact 1 

--= -e 

V V 2 

= coV^/^ (13.65) 

The Hamiltonian represents the total energy density, and the Lagrangian 
represents the energy density difference of these forms of energy. The 
Hamiltonian and Lagrangian have the form H = Ff (r, H, ^) and C = 
C (r, V, ^) where r is position in spherical coordinates. There is no 6* or 0 
dependence of H or C. Everything is spherically symmetric. 




(13.64) 


+ (13.66) 

£ = (13.67) 

As an aside, let us consider the Fermi energy Ef = pchem once again. 
With some algebra, we can write it as a function of voltage. Use Eqs. 
13.37, 13.52, and 13.62. 


Ef = 


h^kj 

2m 


2m V g / 


(13.68) 


Ef = — 
^ 2m 


/-37r2 
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—5mq k—S tt 
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2 \ -2/3' 


3/2 


1 ^ 3/2 
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(13.69) 
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E, = -^V (13.70) 

Notice that the Fermi energy is just a scaled version of the voltage V with 
respect to a ground level at r = cx). Electrical engineers often use the word 
voltage synonymously with potential. When chemists use the term chemical 
potential, they are referring to the same quantity just scaled by a constant. 
Just as voltage is a fundamental quantity of electrical engineering that 
represents how difficult it is to move electrons around, chemical potential 
is fundamental quantity of chemistry that represents how difficult it is to 
move electrons around. 


13.4 Deriving the Thomas Fermi Equation 


As the electron around an atom moves, energy is converted between energy 
of the Coulomb interaction and kinetic energy of the electron. The action 
is 



The path found in nature minimizes the action. 


(13.71) 





= 0 


(13.72) 


The integral is over position, not time. In chapter 11, we called this idea 
the Principle of Least Action. Reference [136, p. 52] calls this idea in this 
context the Second Hohenberg-Kohn Theorem. To find the path, we set up 
and solve the Euler-Lagrange equation. The Euler-Lagrange equation in 
the case where the independent variable is a vector of the form T’ = rhr 
instead of a scalar (with no 0 or 0 dependence anywhere) is given by 


dC ( dC \ ^ 

(9(path) ^ d(path) j j 


(13.73) 


[176, p. 13]. 

As described above, generalized path is voltage V = V{r), and gener¬ 
alized potential is charge density pch = Pchir). As discussed in Chapter 
12, we could have made the opposite choice. In fact, the opposite choice 
may seem more logical because the words voltage and potential are often 
used synonymously. The same result is obtained regardless of the choice. 
However, the algebra is less messy with this choice, and this choice is more 
consistent with the literature. 
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Next, evaluate the Euler-Lagrange equation, Eq. 13.73, using the La- 
grangian of Eq. 13.67. The resulting equation is the equation of motion. 
Consider some of the pieces needed. The derivative of the Lagrangian with 
respect to the path is 


dC 

Wv 




(13.74) 


In Chapter 11, this quantity was defined as the generalized potential. 
Above, we defined pch as the generalized potential. Both and pch have 

units According to Eq. 13.60, ^ is pch multiplied by a constant, and 
that constant is close to one. Since isn’t equal to pch-, our equations 
are not completely consistent. However, the difference is small given the 
extreme assumptions made elsewhere. We also need the generalized mo¬ 
mentum. 



dV 
^ dr ' 


(13.75) 


dC ^ 

Tf)“ 



(13.76) 


Next, put these pieces into the Euler-Lagrange equation. 


^ = 0 


(13.77) 


Use Eq. 13.13. 

- eVV = 0 (13.78) 

VV = |-CoU=^/^ (13.79) 

Next, following Eermi’s original work [177], change variables 



r 


(13.80) 


where y has the units V-m. The Laplacian term on the left can be simplified 
using Eq. 13.12. 

V^U = (13.81) 


VV 


^2 



(13.82) 


VV 


dr 





(13.83) 
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1 d 




rjrt 2 


dy 


dr 


I (dy dy ^d‘^y 


VV=- 


r2 Qj. Qj. 

VV = — 

r dr"^ 


dr‘2 


Eq. 13.79 now simplifies. 


1 d'^y —5 
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= Cir-'/2|/3/2 


In tfie equation above, tfie constant is 


Cl = - —Co(-l)^/^ 
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2e 


-5mq\ 
3^2 J 


3/2 




-g 

37r2 

3/2 


■1) 


1/2 


(5mq\~' q 

\ 3h? J Stt^ 


To clean Eq. 13.89 up furtfier, cfioose 


t = c 


-2/3 


r. 


(13.84) 

(13.85) 

(13.86) 

(13.87) 

(13.88) 

(13.89) 


(13.90) 

(13.91) 

(13.92) 


Tfie variable t here is the name of the independent variable, and it does 
not represent time. It is a scaled version of the position r. 

A ^ (13.93) 

Equation 13.93 is called the Thomas Fermi equation. We have fin¬ 
ished the derivation. The Thomas Eermi equation along with appropriate 
boundary conditions can be solved for y(t). Since the equation is nonlinear, 
numerical techniques are likely used to solve it. Once y(t) is found, Eqs. 
13.56 and 13.80 can be used to find V{r) and Pchir). Erom this equation of 
motion, we can find pch {r), where, on average, the electrons are likely to be 
found as a function of distance from the nucleus in spherical coordinates. 
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13.5 Prom Thomas Fermi Theory to Density Func¬ 
tional Theory 

The analysis considered in this chapter is based on works from 1926 to 
1928 [173] [174], They were early attempts at calculating the location of 
electrons around an atom, and they were developed when the idea of an 
atom itself was still quite new. Half of Fermi’s work is in Italian, and half 
is in German. However, it is clearer than most technical papers written in 
English. 

A calculation is called ab initio if it is from first principles while a 
calculation is called semi-empirical if some experimental data is used to 
find parameters of the solution [136, p. 13]. The Thomas Fermi method is 
the simplest ab initio solution of the calculation of the charge density and 
energy of electrons in an atom [136]. Since no experimental data is used, 
the results of the calculation can be compared to experimental data from 
spectroscopic experiments to verify the results. 

We already know that the results are not very accurate because we made 
a lot of rather extreme assumptions to make this problem manageable. 
Assumptions include: 

• There is no angular dependence to energy, charge density, voltage, or 
other quantities. 

• Temperature is near absolute zero, T ^ 0 K, so that all electrons 
occupy the lowest allowed energy states. 

• There is only one isolated atom with no other charged particles around 
it. 

• The atom is not ionized and is not part of a molecule. 

• The atom has many electrons, and one electron feels effects of a uni¬ 
form cloud due to other electrons. 

• The electrons of the atom do not have any spin or internal angular 
momentum. 

Refined versions of this calculation are known as density functional theory. 
A function is a quantity that takes in a scalar value and returns a scalar 
value. A functional takes in a function and returns a scalar value. The 
name density functional theory comes from the fact that the Lagrangian 
and Hamiltonian are written as functionals of the charge density. Density 
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functional theory calculations do not make as many or as severe of assump¬ 
tions as were made above, especially for the e interact term. These calcu¬ 
lations have been used to calculate the angular dependence of the charge 
density, the allowed energy states of electrons that are part of molecules, 
the voltage felt by electrons at temperatures above absolute zero [136], and 
many other microscopic properties of atoms. Density functional theory is 
an active area of research. Often charge density is chosen as the generalized 
path instead of voltage [136]. 

Both Thomas [173] and Fermi [174] included numerical simulations. 
Amazingly, these calculations were performed way before computers were 
available! More recently, researchers have developed software packages for 
applying density functional theory to calculate the allowed energy levels, 
charge density, and so on of electrons around atoms and molecules [178] 
[179]. Because of the complexity of the calculations, parallel processing is 
used. Computers with multiple processors, supercomputers, and graphics 
cards with dozens of processors have all been used. 

13.6 Problems 


13.1. Generalized momentum is defined as 



(a) Find the generalized momentum for the system described by the 
Lagrangian of Eq. 13.67. 

(b) The generalized momentum does not have the units of momen¬ 
tum. Identify the units of this generalized momentum. 

(c) Write the Hamiltonian of Eq. 13.66 as a function of r, V, and 
M but not as a function of 

dr 

(d) Write the Lagrangian of Eq. 13.67 as a function of r, V, and M 
but not as a function of 

dr 

(e) Show that the Hamiltonian and Lagrangian found above satisfy 

the equation H = — C. 

13.2. In the analysis of this chapter, the generalized path was chosen as V 
and the generalized potential was chosen as pch- The opposite choice 
is also possible where the generalized path is pch and the generalized 
potential is V. 



310 


13.6 Problems 


(a) Write the Hamiltonian of Eq. 13.66 as functions of pch instead 
of V, so it has the form H (r, pch, ■ 

(b) Repeat the above for the Lagrangian of Eq. 13.67. 

(c) Eind the Euler-Lagrange equation using pch as the generalized 
path. 


13.3. Verify that 



is a solution of the Thomas Eermi equation [46]. 


(While this solution satisfies the Thomas Eermi equation, it is not 
useful in describing the energy of an atom. In the t —)■ 0 limit, this 
solution approaches infinity, r/(0) —)■ oo. However, in the t —)■ 0 limit, 
the solution should approach a constant, |/(0) —)■ 1, to correctly de¬ 
scribe the physical behavior of an atom [180].) 

13.4. The previous problem discussed that 


144 



is a solution of the Thomas Eermi equation. Show that 


72 



is not a solution. 

13.5. Prove that the Thomas Eermi equation is nonlinear. 
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14 Lie Analysis 

14.1 Introduction 

In Chapter 11, the ideas of calculus of variations were applied to energy 
conversion processes. We began with two forms of energy and studied how 
those forms of energy varied with variation in some generalized path and 
some generalized potential. The result was an equation of motion that de¬ 
scribed the variation of the generalized path. The equation of motion had 
the form of conservation of generalized potential. In Chapter 12, conserva¬ 
tion laws were listed in the last row of the tables. Knowing how forms of 
energy vary with path and with potential provide significant information 
about energy conversion processes. The purpose of this chapter is to show 
that we can find symmetries, invariants, and other information about the 
energy conversion process by applying Lie analysis techniques to this equa¬ 
tion of motion. If continuous symmetries of an equation can be identified, 
it is often possible to extract quite a bit of information by starting only 
with the equation. 

The equations of motion that result from calculus of variations are not 
always linear. It may or may not be possible to solve a nonlinear equation of 
motion for the path. Even in the cases where it is possible, it is often quite 
difficult because techniques for solving nonlinear differential equations are 
much less developed than techniques for linear equations. Eurthermore, 
many nonlinear differential equations do not have closed form solutions. In 
this chapter, we will see a systematic technique for getting information out 
of nonlinear differential equations that comes from calculus of variation. 
The technique is known as Lie analysis based on the work of Sophus Lie 
in the late part of the nineteenth century. Additionally, this chapter intro¬ 
duces Noether’s theorem. Using this theorem and an equation of motion, 
we may be able to derive conserved quantities. The techniques discussed 
in this chapter apply even for nonlinear equations. 

Lie analysis is a systematic procedure for identifying eontinuous sym¬ 
metries of an equation. If the equation possesses continuous symmetries, 
we may be able to find related conservation laws. Some equations possess 
multiple symmetries and conservation laws while other equations do not 
contain any symmetries or conservation laws. Using this procedure with 
a known generalized path, we may be able to derive conserved quantities 
even if we do not know how to choose the generalized potential at first. 
Some systems might even contain multiple conserved quantities, and this 
procedure will give us a complete set of conserved quantities. 

Lie analysis has been used to find continuous symmetries of many fun- 
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damental equations of physics, and it has been applied to both classical and 
quantum mechanical equations. References [164, p. 117] and [181] apply 
the procedure to the heat equation 

dy 

dt dx"^ 

describing the function y(t,x). It has been applied to both the two dimen¬ 
sional wave equation [164, p. 123] and the three dimensional wave equation 
[181]. Other equations analyzed by this procedure include Schrddinger’s 
equation [182] [183], Maxwell’s equations [184] [185], and equations of non¬ 
linear optics [186]. 

A tremendous amount of information can be gained by looking at the 
symmetries of equations. Knowledge of continuous symmetries may allow 
us to solve equations or at least reduce the order of differential equations 
[164]. If we can identify symmetries, we may be able to simplify or speed 
up numerical calculations by using known repetition in the form of the 
solution. If multiple equations contain the same symmetry elements, we 
can draw comparisons between the equations [164]. We may be able to 
find invariant quantities of the system from known continuous symmetries 
of equations. Hopefully this chapter will provide an appreciation for the 
amount of information that can be gained from applying symmetry analysis 
to equations of motion describing energy conversion processes. 

14,1,1 Assumptions and Notation 

The techniques of this chapter are applied to equations of motion that 
results from describing an energy conversion processes by calculus of vari¬ 
ations. All starting equations of motion are assumed to have only one 
independent and one dependent variable. These equations may or may 
not be linear. Furthermore, all independent and dependent variables are 
assumed to be purely real. We made the same assumptions in Chapter 
11. Most of the examples in this chapter involve second order differen¬ 
tial equations because many of the energy conversion processes studied in 
Chapter 11 led to equations of motion which were second order differential 
equations. However, these techniques apply to algebraic equations and to 
differential equations of other orders. 

In this chapter, total derivatives will be denoted as either ^ or ij. Par¬ 
tial derivatives will be denoted as either ^ or dty for shorthand. If the 
quantity y is just a function of a single independent variable, there is no 
reason to distinguish between total and partial derivatives, ^ Equa¬ 

tions of motion in this chapter will involve one independent and one de¬ 
pendent variable, y{t). However, we will encounter functionals of multiple 
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independent variables such as the Lagrangian £ = C{t,y,^). For such 
quantities, we will have to distinguish between total and partial derivatives 
carefully. 

The analysis here is in no way mathematically rigorous. Furthermore, 
the examples in this chapter are not original. References to the literature 
are included below. 

These techniques generalize to more complicated equations. They apply 
to equations with multiple independent and multiple dependent variables, 
and they apply when these variables are complex [164]. Also, these tech¬ 
niques apply to partial differential equations as well as ordinary differential 
equations, and they even apply to systems of equations [164]. See refer¬ 
ences [164] for how to generalize the methods introduced in this chapter 
to the other situations. 


14.2 Types of Symmetries 

14.2.1 Discrete versus Continuous 


This chapter is concerned with identifying symmetries of equations. We 
say that an equation contains a symmetry if the solution to the equation 
is the same both before and after a symmetry transformation is applied. 
The wave equation is given by 

^ + uly = 0 (14.2) 

where cjq is a constant. When t represents time, uq has units of frequency. 
The wave equation is invariant upon the discrete symmetry 

y ^y= -y. (14.3) 


This transformation is a symmetry because when all r/’s in the equation 
are transformed, the resulting equation contains the same solutions as the 
original equation. 


(P{-y) 

dt'^ 

d^y 

dt^ 


+ uly = 0 

( 14 , 4 ) 

+ ^l{-y) = 0 

( 14 , 5 ) 

+ ujly = 0 

( 14 , 6 ) 


Symmetries can be classified as either continuous or discrete. Contin¬ 
uous symmetries can be expressed as a sum of infinitesimally small sym¬ 
metries related by a continuous parameter. A diserete symmetry cannot 
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be written as a sum of infinitesimal transformations in this way. Three 
commonly discussed discrete symmetry transformations [187] are: 

• Time reversal t —)■ t = (—l)"t , for integer n 

• Parity y ^ y = {—l)'^y, for integer n 

• Charge conjugation y ^ y = y*, where * denotes complex conjugate. 

For example, the wave equation is invariant upon each of these three dis¬ 
crete symmetries because solutions of the equation remain the same before 
and after these symmetry transformations are performed. The transfor¬ 
mation t i = t + e, where e is the continuous parameter which can be 
infinitesimally small, is an example of a continuous transformation because 
it can be separated into a sum of infinitesimal symmetries. Both discrete 
and continuous symmetries may involve transformations of the independent 
variable, the dependent variable, or both variables. In this chapter, we will 
study a systematic procedure for identifying continuous symmetries of an 
equation, and we will not consider discrete symmetries further. 

14.2.2 Regular versus Dynamical 

Continuous symmetries can be classified as regular or dynamical. Regular 
continuous symmetries involve transformations of the independent variables 
and dependent variables. Dynamical symmetries involve transformations 
of the independent variables, dependent variables, and the derivatives of 
the dependent variables [188]. (Some authors use the term generalized 
symmetries instead of dynamical symmetries [164, p. 289].) Only regular 
symmetries will be considered. The techniques discussed here generalize to 
dynamical symmetries [164], but they are beyond the scope of this text. 

14.2.3 Geometrical versus Nongeometrical 

Symmetries may also be classified as geometrical or nongeometrical [184] 
[185]. Nongeometrical symmetry transformations involve taking a Fourier 
transform, performing some transformation of the variables, then taking an 
inverse Fourier transform. The resulting transformations are symmetries 
if the solution of the equation under consideration are the same before 
and after the transformations occur. Nongeometrical symmetries can be 
written as functions of an infinitesimal parameter but are not continuous. 
Nongeometrical symmetries will not be discussed here and are also beyond 
the scope of this text. 
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14.3 Continuous Symmetries and Infinitesimal Gener¬ 
ators 

14.3.1 Definition of Infinitesimal Generator 

Symmetry transformations can be described as transformations of the inde¬ 
pendent and dependent variables. Continuous symmetry transformations 
can be described as transformations of these variables which depend on a, 
possibly infinitesimal, parameter e. 

t^i= F{e)t (14.7) 

y^y = F{e)y (14.8) 

The operator F{e) describes the transformation. It is a function of the 
infinitesimal parameter e, and it may also depend on t and y. Furthermore, 
it is an operator meaning that it may involve derivative operations. 

We are considering only continuous symmetries, so we can study the 
behavior in the limit as e —)■ 0. The operator F{e) can be written as a 
Taylor series in the small parameter e. 

F{e) = l + eU + + ... (14.9) 

The term U in the expansion above is called the infinitesimal generator. It 
may be separated into two components. 

U = ^dt + vdy (14.10) 

The function ^ describes infinitesimal variation in the independent variable. 
The function y describes infinitesimal variation in the dependent variable, 
and it was introduced in Sec. 11.4. Both ^ and y may depend on both the 
independent variable and the dependent variable. 

e = e(tw) (14.11) 

y = y{t,y) (14.12) 

In the limit of e —)■ 0, we can ignore terms of order or higher. 

F{e)'^lFeU (14.13) 

An infinitesimal generator describes a continuous symmetry transforma¬ 

tion. If we know an infinitesimal generator for some continuous symmetry, 
we can find the corresponding transformation 

y e^^y. 


t —)■ e^^t and 


(14.14) 
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To understand where this relationship between infinitesimal generators and 
finite transformations come from, consider the Taylor expansion of [14, 
p. 33], 

= 1 + + 1 ^eUf + ^ {eUf + ... (14.15) 

In the limit as e —)■ 0, 

e^^^l + eU. (14.16) 

Therefore, the corresponding infinitesimal transformation for e —)■ 0 is given 

by 

t^t{l + eCj (14.17) 

y^yil+erf). (14.18) 

14,3.2 Infinitesimal Generators of the Wave Equation 

As an example, consider infinitesimal generators of the wave equation 

-^+ujly = Q. (14.19) 

As mentioned above, the wave equation contains a continuous symmetry 
of the form t —)■ t + e. This continuous symmetry transformation has the 
form 

t—)-t(l + e.^) and y ^ y {1 + er]) (14.20) 

with .^ = 1 and ?7 = 0. It can be described by the infinitesimal generator 

U = ^dt + Vdy = df (14.21) 

More generally, infinitesimal generators and finite transformations are re¬ 
lated by Eq. 14.14, so finite transformations can be derived from infinites¬ 
imal generators. 

t —y (e^^*) t = ^1 -f sdf -\- — {sdt) + ...^ t = t -\- s (14.22) 

y (e^^‘) y= (^1 + ^<9* + ^ + ...^ y = y. (14.23) 

While the symmetry transformation was given in this example, below we 
will see a procedure to derive infinitesimal generators for an equation. 

In general, if we know a solution to an equation and we know that a 
symmetry is present, we can derive a whole family of related solutions to 
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Figure 14.1: The solid line shows a solution to the wave equation. The 
dotted and dashed lines show solutions found using the symmetry trans¬ 
formation t ^ t + e and y ^ y which has infinitesimal generator U = dt. 

the equation without having to go through the work of solving the equation 
again. The wave equation, Eq. 14.19, has solutions of the form 

y(t) = Co cos {uot) + Cl sin {uot) (14.24) 

where boundary conditions determine the constants Cq and Ci. The sym¬ 
metry described by the infinitesimal generator U = dt tells us that 

y{t) = Co cos (cco (t + £)) + Cl sin (tco (t + e)) (14.25) 

must also be a solution. Using Eq. 14.24, we have found a family of 
related solutions because Eq. 14.25 is a solution for all finite or infinitesimal 
constants e. Figure 14.1 illustrates this idea. The known solution is shown 
as a solid line. The dotted and dashed lines illustrate related solutions, 
for different constant c values. We encountered the wave equation in the 
mass spring example of Section 11.5 and the capacitor inductor example 
of Section 11.6, so symmetry analysis provides information about both of 
these energy conversion processes. It tells us that if we run the energy 
conversion process and find one physical path y(t), then for appropriate 
boundary conditions, y(t + e) is also a physical path. This symmetry is 
present in all time invariant systems. Qualitatively for the mass spring 
example, it tells us that if we know the path taken by the mass when we 
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Figure 14.2: The solid line shows a solution to the wave equation. The 
dotted and dashed lines show solutions found using the symmetry transfor¬ 
mation t —)■ t and y ^ y{l + e) which has infinitesimal generator U = ydy. 

remove the restraint today, then we know the path taken by the mass when 
we repeat the experiment tomorrow , and we know this idea from symmetry 
analysis without having to re-analyze the system. 

All linear equations, including the wave equation, contain a continuous 
symmetry transformation described by the infinitesimal generator 

U = ydy (14.26) 

which corresponds to .^ = 0 and rj = y. Again, we can find the correspond¬ 
ing finite transformation using Eq. 14.14. 

t -)■ t=(^l + eydy + ^ {eydyf + ...^ t = t (14.27) 

and 

y 1 / = 1^1 + eydy + ^ {eydyf + ...^ y = y{l + e). (14.28) 

To summarize this transformation. 


t —}■ t 


( 14 . 29 ) 
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and 

y^y{l + e) (14.30) 

The above transformation says that if we scale any solution of a linear 
equation, y{t), by a constant (1 + e), the result will also be a solution of 
the equation. By definition, a linear equation obeys exactly this property. 
By knowing a solution of the wave equation and this symmetry, we can find 
a whole family of related solutions, and this family of solutions is illustrated 
in Figure 14.2. 

The wave equation also contains the symmetry transformation described 
by the infinitesimal generator 


U = sin(a;ot)(9y. 


(14.31) 


The operators ^ and rj can be identified directly from the infinitesimal 
generator. 

e = 0 (14.32) 

T] = sin(a;ot) (14.33) 

Again we can find the corresponding finite transformations using Eq. 14.14. 

t = t (14.34) 

y -)■ = ^1 + esm {uot) <9?; + ^ (^sin {ujot) dyf + ...^ y (14.35) 


y ^ y + e sin {ojot) (14.36) 

If we know a solution y{t) to the wave equation, this transformation tells us 
that y{t) +esin(a;ot) is also a solution. Since e can be any infinitesimal or 
finite constant, we have found another family of solutions using symmetry 
concepts, and these solutions are illustrated in Figure 14.3. 

In this section, we have discussed three of the symmetries of the wave 
equation. The wave equation actually contains eight continuous symme¬ 
try transformations. Deriving these transformations is left as a homework 
problem. 
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Figure 14.3: The solid line shows a solution to the wave equation. The 
dotted and dashed lines show solutions found using the symmetry trans¬ 
formation t —)■ t and y ^ y + e sin (cjot) which has infinitesimal generator 
U = sin (cjoi) dy. 

14.3.3 Concepts of Group Theory 

The study of symmetries of equations falls under a branch of mathematics 
called group theory. When mathematicians use the word group, they have 
something specific in mind. A group is a set of elements along with an 
operation that combines two elements. The operation is called group mul¬ 
tiplication, but it may or may not be the familiar multiplication operation 
from arithmetic. To be a group, the elements and operation must obey four 
additional properties: identity, inverse, associativity, and closure [14, p. 7] 
[164, p. 14]. A group is called a Lie group if all elements are continuously 
differentiable [164, p. 14]. 

The first property of group elements is the identity property which says 
that every group must have an identity element, When the identity 
element is multiplied by any other group element, the result is that other 
element. The inverse property says that each element in the group must 
have a corresponding inverse element which is also in the group. When 
the original element is multiplied by its inverse, the result must be the 
identity element. The associative property says that the product of group 
elements {Xi ■ X 2 ) ■ X^ where the first two elements are multiplied first and 
the product of group elements Xi ■ {X 2 ■ X 3 ) where the last two elements 
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Group property name 

Summary of property 

Identity 

Xi ■ X,, = Xi 

Inverse 

Xi ■ Xf^ = X,, 

Associativity 

(Xi ■ X 2 ) ■ X 3 = Xi ■ (X 2 ■ X 3 ) 

Closure 

Xi ■ X 2 is an element of the group 


Table 14.1: Group properties. 


are multiplied first must be equal. 


(Xi ■ X 2 ) ■ X 3 = Xi ■ (X 2 ■ X 3 ) (14.37) 

The closure property says that when two elements of the group are mul¬ 
tiplied together, the result is another element of the group. Table 14.1 
summarizes these properties where Xi, X 2 , and X 3 are elements of the 
group, and is the identity element which is also a member of the group. 
However, groups may have more or less than three elements. 

In general, the order in which group elements are multiplied matters. 

Xi-X2^X2-Xi. (14.38) 

The quantity Xi ■ X 2 ■ ■ X^^ is sometimes called the commutator, 

and it is denoted [Xi,X 2 ]. Due to the closure property, the result of the 
commutator is guaranteed to be another element of the group [14, p. 21,32] 
[164, p. 39,50]. 

[Xi, X 2 ] = Xi ■ X 2 ■ Xf' ■ X 2-1 (14.39) 

Continuous symmetries of equations are described by infinitesimal gen¬ 
erators that form a Lie group. The elements of the group are the infinitesi¬ 
mal generators scaled by a constant [164, p. 52]. The group multiplication 
operation is regular multiplication also possibly scaled by a constant. Ac¬ 
cording to this definition, U = dt, U = 2dt and U = —lt).2dt are all the 
same element of the group because the constant does not affect the element. 
If we find a few infinitesimal generators of a group, we may be able to use 
Eq. 14.39 to find more generators. A complete set of infinitesimal gener¬ 
ators describe all possible continuous (regular geometrical) symmetries of 
the equation. All continuous (regular geometrical) symmetry operations of 
the equation can be described as linear combinations of the infinitesimal 
generators. 
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14.4 Derivation of the Infinitesimal Generators 

14.4.1 Procedure to Find Infinitesimal Generators 

We are studying differential equations, which can be written as 

F{t,y,y,...) = 0 (14.40) 

for some function F. We are looking for continuous symmetries that can 
be applied to this equation such that the original equation and the trans¬ 
formed equation have the same solutions. The symmetries are denoted by 
infinitesimal generators 

U = idt + r,dy (14.41) 

that describe how the independent variable t and dependent variable y 
transform. Upon a symmetry transformation, the independent variable 
and dependent variable transform, but so do the derivatives of the depen¬ 
dent variable, y, y, ... The prolongation of an infinitesimal generator is a 
generalization of the infinitesimal generator that describes the transforma¬ 
tion of the independent variable, the dependent variable, and derivatives 
of the dependent variable [164, p. 94]. 

The nth prolongation of a generator U is defined as 

PtF)u = rf^^d-y + ..., (14.42) 

and it has terms involving 77 *". The functions 77 * and 77 ** are defined [164], 


w* (fww) = - iy) + iij 

(14.43) 

(U y,y) = ^{y-iy) + iy 

(14.44) 


The quantities 77 ***, 77 ****, and so on can be defined similarly, but they will 
not be needed for the examples below. The prolongation of the infinitesimal 
generator is an operator that describes the transformation of t, y, 7/, y, and 
so on up to the nth derivative. Some authors [189] use the term tangential 
mapping instead of prolongation. 

The procedure to find all possible continuous symmetries of an equation 
is based on the idea that the solutions of an equation remain unchanged 
upon a symmetry operation. For a given transformation to be a symmetry 
operation, not only must all the solutions remain unchanged, but so must 
all derivatives of the solutions. Thus, for a differential equation of the form 
F(t,y,y, ...) = 0, all symmetries U obey the symmetry eondition 
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pr(n)^^ = 0. (14.45) 

We solve this symmetry condition to find all allowed infinitesimal genera¬ 
tors that describe continuous symmetries of the original equation. 

We can use Eqs. 14.43 and 14.44 to write the symmetry condition in 
terms of the components of the infinitesimal generators, ^ and rj. Then, we 
solve the symmetry condition for ^ and rj. This step involves some algebra, 
but it can be accomplished with some patience and an adequate supply of 
ink and paper. 

We can solve the symmetry condition for the allowed infinitesimal gen¬ 
erators. By careful solution, we find all infinitesimal generators of the 
form U = ^dt + rjdy. This procedure gives us a systematic way to find all 
continuous symmetries of the equation. 

This technique applies to any differential equation. We are most inter¬ 
ested in applying it to equations of motion that describe energy conversion 
processes. Erom this technique, we get information about solutions of the 
equation even when the equation of motion is nonlinear. Eurthermore, in 
the Sec. 14.5 we see that we may be able to use the symmetries to find 
invariants of the equation, and invariants often have physical meaning. All 
symmetries of calculus of variations problems of the form S f Cdt = 0 are 
necessarily symmetries of the Euler-Lagrange equation. However, the con¬ 
verse is not necessarily true, so not all symmetries of the Euler-Lagrange 
equation are symmetries of the integral equation [164, p. 255]. 

14.4.2 Thomas Fermi Equation Example 

As an example, we apply this procedure to the Thomas Eermi equation 

y = . (14.46) 

This equation was derived in Chapter 13. Erom the solution of this equation 
r/(t), the charge density Pchij') of electrons around an isolated atom and 
the voltage V (r) felt by the electrons can be calculated within the, rather 
severe, assumptions specified in that chapter. The independent variable of 
the equation is a scaled version of radial position, not time. However, t will 
be used as the independent variable here because the procedure applies to 
equations regardless of the name of the variable. Reference [190] applies this 
procedure to a family of equations known as the Emden-Eowler equations. 
The Thomas Eermi equation is a special case of an Emden-Eowler equation, 
so the result of this example can be found in reference [190]. 
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We would like to identify continuous symmetries of Eq. 14.46. These 
symmetries will be specified by infinitesimal generators of the form 


U = idt + r]dy 


(14.47) 


where ^ and rj have the form ^(t,y) and rjit^y). Solutions of the equation 
satisfy 

{y - = 0. (14.48) 

For infinitesimal generators that describe symmetries of this equation, the 
prolongation is also zero. 


pr(n)7/(^-^3/2^-V2) ^ 0 . 


(14.49) 


(14.51) 


Eq. 14.49 can be solved for all generators V corresponding to continuous 
symmetries of the Thomas Fermi equation. Eqs. 14.42 and 14.49 can be 
combined. 

t = 0 (14,60) 

Next, Eq. 14.44 is used. 

dttV + ‘iydytri + ydyT] + y^dyyT] - 2ydt^ - ydui - 2y^dyt^ 

-y^dyyi - ?>yydyi + = 0 

Substitute the original equation for y. 

dttV + 2ydyty + y^lH-^l'^dyy + ip'dyyy - 2y'^lH-^l‘^dti - ijdtti - 
—y^dyy^ — 3yy^^H~^^'^dy^ + = 0 

(14.52) 

Regroup terms. 

{dttV + y^^H~^^‘^dyri — 2y^l‘^t~^l‘^dti + 

+y {‘^dytv - dtt^ - 3y^/H-^/^dy^) + y^ {dyyy - 2dyti) - y^ (dyyO = 0 

(14.53) 

Each of the terms in parentheses in Eq. 14.53 must be zero. 

= 0 (14,64) 


2dyty - dtti - 3y^^H = 0 

dyyTj - 2dyti = 0 


(14.55) 

(14.56) 

(14.57) 
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Eqs. 14.54, 14.55, 14.56, and 14.57 can be solved for ^ and 77 . Erom Eq. 
14.57, dyy^ = 0, so ^ must have form 

^ = {ci + C2y)b{t). (14.58) 

The quantities denoted Cn are constants. Erom Eq. 14.56, rj must have the 
form 

v= {c3 + caV + C^y^) g{t). (14.59) 

Eunctions b(t) and g(t) only depend on t, not y. The condition of Eq. 14.55 
can be rewritten. 


{2cAdtg - cidttb) + yi^c^dtg - 2c2dub) - Sy^^H ^^^C2b = 0 (14.60) 


To satisfy Eq. 14.60, C 2 must be zero, and either C 5 = 0 or g(t) = 0. 
Erom Eqs. 14.55 and 14.56, dyT] and dt^ must be constant. Therefore, the 
form of ^ must be 

^ = Cq + c-jt. (14.61) 

This form can be substituted into Eq. 14.54. 


|/3/2t 3/2 (^C4 + 2c^y) — 27/3/2^ 1/2-|- 1(^C6 + C7t)y^^H 3/2 
-|(c3 + C4y + c^y^)y^lH-^l‘^ = 0 

^3/2^-1/2 _ 2c, + 1C7 - |C4) + 

-ic3l/^/V3/2+7/5/27-1/2 (2 c5-|c5)=0 


(14.62) 

(14.63) 


The coefficients C 3 , C 5 , and cq must be zero. Also, C 4 = —3cj. No other 
solutions here are possible. Thus, the symmetry condition of Eq. 14.49 can 
be satisfied by .^ = t and y = —3y . 

This procedure finds one regular continuous infinitesimal symmetry of 
the Thomas Eermi equation, with infinitesimal symmetry generator 


U = tdt- 3ydy. 


(14.64) 


No other solutions can satisfy the constraints given by Eq. 14.49. There¬ 
fore, this equation has only one continuous symmetry. 

Einite transformations are related to infinitesimal transformations by 
Eq. 14.14. In this case, the independent variable transforms as 


t^i= 


(14.65) 


t —)■ t — 


1 -f e {tdt - 3ydy) + —{tdt - 3ydyf + ... 


(14.66) 
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t^t= t + et{dtt) + {dtt) + ... (14.67) 

t ^ i = te^ (14.68) 

The dependent variable transforms as 

y^y = e^(ta,-3ydy)y_ 

y ^y= l + e{tdt- 3ydy) + {tdt - Sydyf + ... y (14.70) 

y^y = ye-^^ (14.71) 

Defining the constant Cq = e^, the transformation can be written as 

t ^ CQt and y^{cQ)~^y. (14.72) 

The analysis above shows that the original Thomas Fermi equation of Eq. 
14.46 and the transformed equation 

{y^6^) f -3\3/2 , .-1/2 n/^'7Q^ 

,2 = [yce ) (tee) (14.73) 

d [tco] 


have the same solutions. From it, we can conclude that if y{t) is a solution 
to the Thomas Fermi equation, we know that c^^y{T) for r = c^t is also a 
solution. 

14.4.3 Line Equation Example 

Consider another example of this procedure applied to the equation i/ = 0. 
The solution of this equation can be found by inspection 

y{t) = Cot + Cl (14.74) 

because this is the equation of a straight line. The coefficients Cn are con¬ 
stants, and they are different from the previous example. In this example, 
we will identify the infinitesimal generators for continuous symmetries of 
this equation, and we will find eight infinitesimal generators. The result of 
this problem appears in [191], and it is a modified version of problem 2.26 
of reference [164, p. 180]. 

Solutions of the original equation must be the same as solutions of an 
equation transformed by a continuous symmetry, and this idea is contained 
in the symmetry condition of Eq. 14.45. In this case, the original equation 
is 1/ = 0, so the prolongation of an infinitesimal generator acting on this 
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equation must also be zero for an infinitesimal generator U that describes 
a continuous symmetry. 


= 0. (14.75) 

Using Eqs. 14.42, 14.43, and 14.44, we can write this symmetry condition 
in terms of ^ and rj. 


= 0 

= {) = dttf] + 2ydytr] + ydyT] + ij’^dyyT] - 2ydt^ 

-ydtti - 2y‘^dyti - fdyy^ - 
Use 1/ = 0, and regroup the terms. 


(14.76) 

(14.77) 


{dttv) + y {2dytV - dttO + y'^ {dyyV - 2dyti) - f {dyyO = 0 (14.78) 

The above equation is true for all y only if all of the quantities in 
parentheses are zero. 


dttV = 0 

(14.79) 

2dytr] - dtti = 0 

(14.80) 

dyyT] - 2 dyti = 0 

(14.81) 


(14.82) 

The next step is to solve the above set of equations for all possible solutions 
of S, and rj which will determine the infinitesimal generators of all possible 
continuous symmetry transformations. 

We will consider three cases: case 1 with 77 = 0, case 2 with = 0, and 
case 3 with both ^ and rj nonzero. 

Case 1 with 77 = 0 : Assume 77 = 0. What solutions can be found for 
Equation 14.79 to Eq. 14.82 can be reduced. 

du^ = 0 

(14.83) 

^yy^ ~ 0 

(14.84) 

dyt^ = 0 

(14.85) 
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There are three possible independent solutions for They are ^ = 1, ^ = t, 
and i = y. So, we found three infinitesimal generators. 


Ui = d, 

(14.86) 

U2 = tdt 

(14.87) 

Us = ydt 

(14.88) 

Case 2 with = 0: Suppose ^ = 0. What solutions can be found for rj? 
Equation 14.79 to Eq. 14.82 simplify. 

dttV = 0 

(14.89) 

dytV = 0 

(14.90) 

^yyV — 0 

(14.91) 

There are three possible independent solutions for rj. They are rj = 
and T] = t. So, we found three more infinitesimal generators. 

= y, 

U^ = dy 

(14.92) 

Us y^y 

(14.93) 

Ue = tdy 

(14.94) 

Case 3 where both ^ and y are nonzero: Erom Eq. 14.79, we can write 

7] = {ci +C2t)b{y). 

(14.95) 

Here, 6 is a function of only y, not t. Therefore, 


dyti] = C2dyb{y) 

(14.96) 

which is not a function of t. 

Erom Eq. 14.82, we can write 


^ = {cs + C4y)g{t). 

(14.97) 

Here, is a function of only t, not y. Therefore, 


dyt^ = Cidtgit) 

(14.98) 


which is not a function of y. Now use Eq. 14.80. 


2c2dyb{y) - (c3 + c^y) dug = 0 


(14.99) 
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The first term is not a function of t. Therefore, ^ is at most quadratic in 


t. So, ^ has the form 

^ = (c 3 + c^y) (c 5 + cet + . (14.100) 

Distribute out the multiplication. 

^ = C 3 C 5 + c^cet + CsCjt'^ + C4C5y + C4Ceyt + C4C7yt‘^ (14.101) 

9yt^ = C 4 C 6 + 2c4C7t (14.102) 

Next, use Eq. 14.81. 

dyyT] - 2c4dtg = 0 (14.103) 

The second term is not a function of y. Therefore, rj is at most quadratic 
in y. So, rj has the form 

7] = {ci + Cst) (cg + Cgy + Cloy‘d) . (14.104) 

Distribute out the multiplication. 

Tj = CiCg + Cl Cgy + CiCiol/^ + C2Cst + C2Cgyt + C2Cigty‘^ (14.105) 

dyty = C 2 C 9 + 2c2Cioy (14.106) 

Now use Eqs. 14.80 and 14.106. 

2 (C 2 C 9 + 2c2Cioy) - 2 ( 2 C 3 C 7 + 2yc4CY) = 0 (14.107) 

( 2 C 2 C 9 - 4 C 3 C 7 ) + y ( 4 c 2 Cio - 4 C 4 C 7 ) = 0 (14.108) 

We end up with the pair of equations 

C 2 C 9 = 2 C 3 C 7 (14.109) 

C 2 C 10 = C 4 C 7 (14.110) 

Next use Eqs. 14.81 and 14.102. 

(2ciCio + 2tc2Cio) — 2 (C 4 C 6 + 2c4C7t) = 0 (14.111) 


(2ciCio - 2 C 4 C 6 ) + t (2c2Cio - 4 C 4 C 7 ) = 0 


(14.112) 
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and we end up with a pair of equations. 


CiCio = C 4 C 6 (14.113) 

C 2 C 10 = 2 C 4 C 7 (14.114) 

These are the only possible solution of Eqs. 14.110 and 14.114. 

Finally, there are two possible solutions which are independent from the 
previously found solutions. We can set the coefficients of Eq. 14.113 to 1. 
The first solution is 77 = and ^ = yt corresponding to 

Uj = ytdt + y^dy. (14.115) 

For the second solution, we can set the coefficients of Eq. 14.109 to 1. The 
second solution is rj = yt and i corresponding to 

Us = t^dt + ytdy. (14.116) 

At this point, we have found eight infinitesimal generators. These are all 
possible generators of continuous regular nongeometrical symmetries. 

14.5 Invariants 

14.5.1 Importance of Invariants 

Noether’s theorem describes the relationship between continuous symme¬ 
tries of an equation describing an energy conversion process and invariants 
of the system. The theorem was originally discovered by Noether around 
1918 [165] [166]. The importance of this theorem is described in the intro¬ 
duction to the English translation of the original paper [165]. “The well 
known theorem of Emmy Noether plays a role of fundamental importance 
in many branches of theoretical physics. Because it provides a straightfor¬ 
ward connection between the conservation laws of a physical theory and 
the invariances of the variational integral whose Euler-Lagrange equations 
are the equations of that theory, it may be said that Noether’s theorem has 
placed the Lagrangian formulation in a position of primacy.” 

14.5.2 Noether’s Theorem 

Consider an energy conversion process with a known Lagrangian that sat¬ 
isfies an Euler-Lagrange equation. Assume that we have identified contin¬ 
uous symmetries described by infinitesimal generators. Noether’s theorem 
says that there is a relationship between these continuous symmetries and 
conservation laws which say that some quantity is invariant. We would like 
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to find the corresponding conservation laws and invariants. If we can find 
a quantity G that satisfies, 

f = pr<">C/£ + 4. (14.117) 

then the quantity 

is an invariant. For Lagrangians with units of joules, the quantity G also has 
units joules. In Eq. 14.117, pr(")f/£ is the prolongation of the infinitesimal 
generator acting on the Lagrangian where prolongation was defined in Eq. 
14.42. 

14.5.3 Derivation of Noether’s Theorem 

We can derive this form of Noether’s theorem, and this derivation closely 
follows the clear and simplified derivation in reference [192]. This theorem 
is detailed and derived more rigorously in multiple other references [163, p. 
208] [164]. For the purpose of this derivation, assume that we begin with 
an equation of motion that is at most a second order differential equation. 
However, the ideas generalize to higher order equations too. Also, assume 
we know the corresponding Lagrangian of the form C = C(t, y, y). The 
general approach is to assume that we can find a value of G defined by Eq. 
14.117. We will perform some algebra on Eq. 14.117 to show that choice 
of G necessarily implies that T is invariant. 

Use the definition of the prolongation to write Eq. 14.117 in terms of ^ 
and f]. 

= ^dt + rjdy + rfdy (14.119) 

For a second order differential equation, no more terms are needed because 
the Lagrangian depends on, at most, the first derivative y. Substitute the 
prolongation acting on the Lagrangian into Eq. 14.117. 

^ = [idtC + ydyC + y^dyC] + (14.120) 

Consider the continuous transformation described by 

t ^t= {l + ei + e^..)t (14.121) 


and 


y^y= {l + er] + e^...)y 


(14.122) 
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in the limit e —)■ 0. The Lagrangian £ (t, y, y) of an energy conversion pro¬ 
cess represents the difference between two forms of energy. The Lagrangian 
£ (t, y, y) represents the difference between two forms of energy upon the 
continuous symmetry transformation described by infinitesimal generator 
U. Qualitatively, the quantity ^ represents the change in £^ with respect 
to e in this limit [192]. 


dG 

dt 


A 

de 


^ (£ y, y) 


dt 

dt 


(14.123) 


Use Eq. 14.43 to substitute for 77 * in Eq. 14.120. 

= ^{v-^y) + ^y (14.124) 


dG 

dt 


= v-^y-& + y = v-y^ 


idtC + r]dyC + it] -yu dyC 




(14.125) 

(14.126) 


We want to express the right side as the total derivative of some quan¬ 
tity, which we call G. With some algebra, we can write this as a total 
derivative. We will use the definition of the total derivative. 


— = dtC + ydyC + ydyC (14.127) 

dtC = C - tjdyC - ydyC (14.128) 

^ ivdy^) = vdyC + 7]^ (dyC) (14.129) 

v9yC = ^ (ydyC) - (dyC) (14.130) 


^ i^ydyC) = iydyC + iydyC + Ojj^dyC (14.131) 

iy^y^ = ^ {iydyC) - iydyC - ^y^dyC ( 14 . 132 ) 

Use these pieces to replace the terms of ^ in brackets. 


dG 

dt 


dG r 

— = [^dtC] + ydyC + [ydyC] - y^dyC 






£ - ydyC - ydyC + ydyC + (ydyC) -yf^ {dyC)] 

tjidyc] +ic 


(14.133) 

(14.134) 
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dC r • 1 

— = ^ - ydyC - ydyC^ + ydyC + [| (ydyC) - [dyC)] (14.135) 

- [it - ^yfA^] + A 

Two terms cancel. 

dG 

— = A-^ydyC + ydyC+f^iydyC)-yf^idyC) (14.136) 

-i{(ydy£)+(AdA + A 

Regroup terms. 
dG 

- PA) iv - yO (14.137) 

+ [({£ + £?) + ! (>)9s£) - I (?!/9 s£) 

The first term in parentheses is zero because the Lagrangian C satisfies the 
Euler-Lagrange equation. 

^ = 4 ({£ + - (^ja.£)) (14.138) 

4 [?£ + (,,d^£) - - G| = 0 (14.139) 

Therefore, if we can find G, then the quantity in brackets T must be in¬ 
variant. 

T = -I- (ydyC) — {^ydyC) — G = invariant (14.140) 


14.5.4 Line Equation Invariants Example 

Let us apply Noether’s theorem to some examples. Eirst, consider the fine 
equation y = 0 which results from application of calculus of variations with 
Lagrangian 

C = ^y^. (14.141) 

A continuous symmetry of this equation is described by the infinitesimal 
generator U = dy with .^ = 0 and y = 1. The prolongation of the generator 
acting on the Lagrangian is zero. 


AUC = rfii = y i A] = 0 


(14.142) 


Using Eq.14.117, we see that (7 = 0. 


dt 


0 + £-0 


0 


(14.143) 
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Next use Eq. 14.118 to find the invariant. 

dC 

T = ri— = y (14.144) 

Qualitatively, if represents the slope of the line, so this invariant tells us 
that the slope of the solutions to the line equation must be constant. 

Another continuous symmetry of this equation is described by the in¬ 
finitesimal generator U = tdy with .^ = 0 and y = t. We can solve for the 
prolongation of the generator acting on the Lagrangian. 

(t - 0) + 0^ = r/ 

We can find G using Eq.14.117, and we can find the invariant using Eq.14.118. 

dG In , , 

^ = 2/+2^ -0 = 2/ (14.145) 

G = y (14.146) 

T = y-ty (14.147) 

Qualitatively, this invariant represents the y-intercept of the line, so this 
invariant tells us that the y-intercept of the solution to the line equation 
must be constant. 


pr(")f/£ = if^y = y{^ 


14.5.5 Pendulum Equation Invariants Example 

Consider the equation describing a pendulum, studied in Problem 11.8. 
The energy conversion process is described by the Lagrangian 

C =-mif'^ — mg cosy (14.148) 

which corresponds to the equation of motion 


y = gsmy. (14.149) 

In these equations m represents the mass, and g represents the gravitational 
constants. Both m and g are assumed constant here. This equation of 
motion has only one continuous symmetry described by the infinitesimal 
generator U = dt with .^ = 1 and ?7 = 0. We can use Noether’s theorem to 
find the corresponding invariant. 
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Use Eq. 14.117 to find G. 


^ = pr<"'C/£ + C^ 

at at 


dG . . . .d^ 

—— = 7] my + Tjmg sm.y + L— 
dt dt 


dG f . . f d , 

— = r^my = ym\—[y-iy)+iy 


dG f di ^ ^ , 


G = 0 

Use Eq. 14.118 to find the invariant. 


'T = yy + iC- imyy - 0 


(14.150) 

(14.151) 

(14.152) 

(14.153) 

(14.154) 

(14.155) 


'V' ^ * 2 

T = -my 


mg cos y — my 


(14.156) 


^ • 2 

i = —my 


gm cos y 


(14.157) 


The quantity T is conserved, and it is the Hamiltonian which represents 
total energy. 

Whenever the Lagrangian does not explicitly depend on t, the system 
contains the continuous symmetry described by the infinitesimal generator 
U = dt- This infinitesimal generator has = 1 g = 0. Erom Eq. 
14.117, G must be zero. Erom Eq. 14.118, the corresponding invariant has 
the form 

T = C-myy (14.158) 


which has the magnitude of the total energy (assuming t is time). This 
equation is equal to the Hamiltonian of Eq. 11.24. Therefore, if an equation 
of motion contains the symmetry (9t, energy is conserved. 
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14.6 Summary 

In this chapter, a procedure to find continuous symmetries of equations 
was presented. Also, the relationship between continuous symmetries and 
invariants, known as Noether’s theorem, was discussed. If we can describe 
an energy conversion process by a Lagrangian, we can use the techniques 
of calculus of variations detailed in Chapter 11 to find the equation of 
motion for the path. We can use the procedure discussed in this chapter to 
identify continuous symmetries of the equation of motion. These symmetry 
transformations are denoted by infinitesimal generators which describe how 
the independent and dependent variables transform. We also may be able 
to use Noether’s theorem to find invariants of the system. We can apply 
this analysis even in cases where the equation of motion is nonlinear or 
has no closed form solution. The invariants often correspond to physical 
quantities, such as energy, momentum, or angular momentum, which are 
conserved in the system. Knowledge of invariants can help us gain insights 
into what quantities change and what quantities do not change during the 
energy conversion process under study. 
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14.7 Problems 


14.1. Three commonly discussed discrete symmetry transformations are: 
Time reversal t —)■ t = (—l)"t for integer n 

Parity y ^ y = (—l)”r/ for integer n 
Charge conjugation y ^ y = y* 

Verify that the wave equation, y + UQy = 0, is invariant upon each of 
these discrete transformations. 

14.2. Repeat the problem above for the equation y + y~^ = 0. 

14.3. The Thomas Fermi equation is given hy y = . 

(a) Verify that it is not invariant upon the discrete symmetry trans¬ 
formation of time reversal, 

t = (—1)"^ for integer n. 

(b) Verify that it is not invariant upon the discrete symmetry trans¬ 
formation of parity, 

y ^ y = {—l)'^y for integer n. 

(c) Verify that it is invariant upon the discrete symmetry transfor¬ 
mation 

and y^y = {-l)''y. 

14.4. Find the prolongation of the infinitesimal generator 

U = ^dt + rjdy 

acting on the Lagrangian 

^ 1 9 1 9 

Write your answer in terms of ^ and rj but not rj^ or 

14.5. Find the infinitesimal generators for the equation, y + y~^ = 0. (This 
problem is discussed in [190].) 

Answer: 

Ui=dt 

U 2 = 2tdt -f ydy 
U 3 = t^dt -f tydy 
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14.6. The equation y + y~^ = 0 has the three infinitesimal generators fisted 
in the problem above. These infinitesimal generators form a group. 
The commutator was defined in Section 14.3.3, and the commutator 
of any pair of these infinitesimal generators can be calculated by 


[UaM=UaU,-U^Ua. 

Using the equation above, show that the commutator for each of the 
three pairs of infinitesimal generators results in another element of 
the group. 

14.7. Derive the infinitesimal generators for the wave equation, y+u^y = 0. 
(This problem is discussed in [191].) 

Answer: 

Ui = dt 
U2 = ydy 
7/3 = sin {uot) dy 
7/4 = cos {uot) dy 

7/5 = sin(2a;o7)i9t + Uoy cos{2uot)dy 
Uq = cos{2uot)dt — Uoy sm{2uot)dy 
7/7 = 1 / cos {uot) dt - uoy"^ sin {uot) dy 
Us = ysin {uot) dt + uoy'^ cos {uot) dy 

14.8. The wave equation y + u^y = 0 has the eight infinitesimal generators 
listed in the problem above. The corresponding Lagrangian is 

r _ ^ -2 1 2 2 

^ - 2 ^ ■ 

Find the invariants corresponding to the following infinitesimal gen¬ 
erators. 

(a) 7/1 = dt 

(b) 7/3 = sin {uot) dy 

(c) 7/5 = sin(2c<;o/)i9t + uoy cos(2uot)dy 
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14.9. In Problem 11.8, we encountered the equation given by 1/ = gsiny 
for constant g. 

(a) Show that U = dt is an infinitesimal generator of this equation. 

(b) Show that U = ydy is not an infinitesimal generator of this 
equation. 

14.10. The Lagrangian 

^ 1 9 1-9 

corresponds to the equation of motion y + y~^ = 0 . This equation 
of motion has three infinitesimal generators: 

Ui = dt 

U 2 = 2tdt + ydy 

U3 = fdt + tydy 

Use Noether’s theorem to find the invariants that correspond to each 
of these infinitesimal generators. (We encountered this Lagrangian in 
problem 11.3.) 
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Appendices 

Appendix A: Variable List 

Vectors are denoted V, and unit vectors are denoted v. In the third column, 
an asterisk * indicates that the units vary depending on the context. In 
the fourth column, S = scalar, V = vector, C =constant, F = functional, 
and O = operator. Constants are specified to four significant figures. 


Symbol 

Quantity 

SI Units 

Scalar? 

Comments 

az 

Cartesian 
coordinate unit 

vectors 

unitless 

V 



Spherical 
coordinate unit 

vectors 

unitless 

V 


't 

Acceleration 

m 

S2 

V 


A 

Cross sectional 

area 

m^ 

S 


Ai2 

Einstein A 
coefficient 

s-i 

s 


^aff 

Electron affinity 

J 

atom 

s 


1) 

Pyroelectric 

coefficient 

C 

m2.K 

V 


Bi2i B 21 

Einstein B 

coefficient 

m3 

.I.S2 

s 


t 

Magnetic flux 
density 

Wb 

m2 

V 


B 

Bulk modulus 

N 

nU 

s 


c 

Speed of light in 
free space 

m 

s 

c 

= 2.998 ■ 10® 
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Symbol 

Quantity 

Units 

Scalar? 

Comments 

Cn 

Coefficient 

unitless 

S 

Eor integer n 

c 

Capacitance 

E 

S 


c. 

Specific heat 

J 

kg.K 

s 


d 

Piezoelectric strain 

constant 

m 

V 

s 

May be a 
scalar or 
matrix 

dthick 

Thickness 

m 

s 


D 

Directivity 

unitless 

s 

Eor antennas 


Displacement fiux 
density 

C 

m2 

V 


e 

e 

unitless 

c 

^2.718 

e~ 

Electron 



Used in chem. 
reactions 

E 

Energy 

J 

s 



Electric field 
intensity 

V 

m 

V 


Ef 

Eermi energy level 

J 

s 

Also called 

chemical 

potential 

E, 

Energy gap 

J 

s 

Also called 
bandgap 

f 

Erequency 

Hz 

s 


f 

Eorce 

N 

V 


9 n 

Degeneracy of 
energy level n 

unitless 

s 


G 

Component of an 
invariant 

J 

s 
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Symbol 

Quantity 

Units 

Scalar? 

Comments 

h 

Planck constant 

J-s 

C 

= 6.626 ■ 10-34 

H 

Hamiltonian 

J 

F 


Hqm 

Quantum 

Mechanical 

Hamiltonian 

J 

0 


t 

Magnetic field 
intensity 

A 

m 

V 


[Hi 

Amount 
concentration 
hydrogen ions 

mol 

L 

s 


i 

Current (AC or 
time varying) 

A 

s 


I 

Current (DC) 

A 

s 


I 

Moment of inertia 

kg ■ m^ 

s 


^ioniz 

Ionization energy 

J 

atom 

s 


j 

Imaginary number 

unitless 

c 


J 

Volume current 
density 

A 

m2 

V 


k 

Wave vector 

m“^ 

V 


ks 

Boltzmann 

constant 

J 

K 

c 

= 1.381 ■ 10-23 

kf 

Fermi wave vector 

m“^ 

s 


K 

Spring constant 

J 

m2 

s 


K 

Torsion spring 
constant 

J 

rad^ 

s 


1 

Length 

m 

s 


L 

Inductance 

H 

s 
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Symbol 

Quantity 

Units 

Scalar? 

Comments 

Lam 

Angular 

momentum 

J-s 

V 


c 

Lagrangian 

J 

F 


m 

Mass 

kg 

S 


M 

Generalized 

momentum 

* 

S 

Many authors 
use p 

dl 

Momentum 

kg-m 

S 

V 

Many authors 

use 

Mqm 

Quantum 

mechanical 

momentum 

kg-m 

S 

v,0 


n 

Concentration of 
electrons 

m 

s 


n 

Index of refraction 

unitless 

s 


n 

Integer 

unitless 

s 


N 

Total number of e~ 
per atom 

electrons 

atom 

s 


Na 

Avogadro constant 

molecule 

mol 

c 

= 6.022 ■ 10^3 

N, 

Number of valence 

e~ 

electrons 

s 


n 

Number of moles 

mol 

s 


p 

Concentration of 
holes 

m 

s 


p 

Power 

w 

s 



Material 

polarization 

c 

m2 

V 


p 

Pressure 

Pa 

s 
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Symbol 

Quantity 

Units 

Scalar? 

Comments 

Q 

Magnitude of 
electron charge 

C 

C 

= 1.602 ■ 10-^9 

Q 

Charge 

C 

S 


Q 

Heat 

J 

S 


r 

Distance in 

spherical 

coordinates 

m 

s 



Position in 

spherical 

coordinates 

m 

V 


r 

Distance in 
reciprocal space 

m“^ 

s 


R 

Resistance 

Q 

s 


Rh 

Hall resistance 

Q 

s 


M 

Molar gas constant 

J 

c 

= 8.314 

mol K 

R 

Mirror reflectivity 

unitless 

s 


s 

Kerr coeflficient 

m2 

s 


$ 

Seebeck coeflficient 

V 

K 

s 



Entropy 

J 

K 

s 


§ 

Action 

J 

E 


t 

Time 

s 

s 


T 

Temperature 

K 

s 


u 

Energy density per 
unit bandwidth 

J-s 

nU 

s 


U 

Inflnitesimal 

generator 

unitless 

0 


u 

Internal energy 

J 

s 





346 


Appendices 


Symbol 

Quantity 

Units 

Scalar? 

Comments 

V 

Velocity 

m 

s 

V 


V 

Voltage (AC or 
time varying) 

V 

S 


V 

Voltage (DC) 

V 

S 


V 

Volume 


s 


K) 

Contact potential 
of pn junction 

V 

s 



Redox potential 

V 

s 


^cell 

Cell potential 

V 

s 

Many authors 

use or 

w 

Mechanical work 

J 

s 



Positional 

displacement 

m 

V 


y 

Dependent variable 
of equation 

* 

s 


y 

Shorthand for total 
derivative ^ 

* 

s 


z 

Figure of merit 

K-i 

s 


Zo 

Characteristic 

impedance 

Q 

s 


a 

Absorption 

coefficient 

m“^ 

s 


7 

Pockels coefficient 

m 

V 

s 


A 

Delta (change in) 

unitless 

0 


e 

Permittivity 

F 

m 

s 


eo 

Permittivity of free 
space 

F 

m 

c 

= 8.854-10-^2 
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Symbol 

Quantity 

Units 

Scalar? 

Comments 


Relative 

permittivity 

unitless 

S 


e 

Infinitesimal 

parameter 

unitless 

S 


7] 

Transformation of 
dependent variable 

* 

s 


Veff 

Efficiency 

unitless 

s 


e 

Angle (Elevation) 

rad 

s 


t 

Angular 

displacement 

vector 

rad 

V 


K 

Thermal 

conductivity 

W 
m K 

s 


X 

Wavelength 

m 

s 


fi 

Permeability 

H 

m 

s 


f^o 

Permeability of 
free space 

H 

m 

c 

= An ■ 10“^ 

= 1.257- 10“^^ 

jJ,J. 

Relative 

permeability 

unitless 

c 


l^chem 

Chemical potential 

J 

atom 

s 

Also known as 
Eermi energy 
level 

/^n 

Mobility of 
electrons 

m2 

V-s 

s 


f^p 

Mobility of holes 

m2 

V-s 

s 


e 

Transformation of 

independent 

variable 

* 

s 


n 

Peltier coefficient 

V 

s 
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Symbol 

Quantity 

Units 

Scalar? 

Comments 

P 

Resistivity 

Um 

S 


Pch 

Charge density 

C 

m3 

S 


Pdens 

Mass density 

kg 

m3 

s 


a 

Electrical 

conductivity 

1 

om 

s 



Stress 

Pa 

V 


T 

Thomson 

coefficient 

V 

K 

s 



Torque 

N-m 

V 



Angle (Azimuth) 

rad 

s 


T 

Invariant 

* 

s 


Xe 

Electric 

susceptibility 

unitless 

s 


X 

Electronegativity 

J 

atom 

s 



Wave function 

unitless 

s 



Magnetic flux 

Wb 

s 


u 

Erequency 

rad 

s 

s 


^ang 

Angular velocity 

rad 

s 

V 


h 

Planck constant 
divided by 2tt, also 
called hbar 

J-s 

c 

= 1.055 ■ 10-34 


Gradient operator 


0 

Also called del 

dt 

Shorthand for 
partial derivative 

JL 

dt 


0 





Appendices 


349 


Appendix B: Abbreviations of Units of Measure 

Common abbreviations for units of measure are listed in Table 14.2. This 
table does not cover all units used in this text. Further measures of energy 
and power are discussed in Section 1.4. For further information, see [68]. 
The fourth column indicates whether the unit is an SI base unit, an SI 
derived unit, or not an SI unit. Table 14.3 lists prefixes used with SI units 
| 193 |. 


Abbreviation 

unit 

Measure 

SI unit? 

A 

ampere 

Current 

Base 

cd 

candela 

Luminous intensity 

Base 

C 

coulomb 

Charge 

Derived 

°C 

degree Celsius 

Temperature 

Derived 

d 

day 

Time 

No 

eV 

electronvolt 

Energy 

No 

F 

farad 

Capacitance 

Derived 

°F 

degree Fahrenheit 

Temperature 

No 

H 

henry 

Inductance 

Derived 

Hz 

hertz 

Frequency 

Derived 

h 

hour 

Time 

No 

J 

joule 

Energy 

Derived 

K 

kelvin 

Temperature 

Base 

kg 

kilogram 

Mass 

Base 

L 

liter 

Volume 

No 

m 

meter 

Length 

Base 

mol 

mole 

Amount of substance 

Base 

N 

newton 

Eorce 

Derived 

Pa 

pascal 

Pressure 

Derived 

rad 

radian 

Angle 

Derived 

s 

second 

Time 

Base 

V 

volt 

Voltage 

Derived 

W 

watt 

Power 

Derived 

Wb 

weber 

Magnetic flux 

Derived 

n 

ohm 

Resistance 

Derived 


Table 14.2: Units and their abbreviations. 
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Prefix name 

Symbol 

Value 

yotta 

Y 

1024 

zetta 

Z 

1024 

exa 

E 

o 

00 

peta 

P 

1045 

tera 

T 

1042 

giga 

G 

104* 

mega 

M 

10® 

kilo 

k 

lO^ 

milli 

m 

o 

1 

CO 

micro 

/r 

10-5 

nano 

n 

10-4* 

pico 

P 

10-42 

femto 

f 

10-45 

atto 

a 

o 

1 

00 

zepto 

z 

10-24 

yocto 

y 

o 

1 

to 

►k- 


Table 14.3: Prefixes used with SI units. 
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Appendix C: Overloaded Terminology 

Physicists, chemists, electrical engineers, and other scientists develop their 
own notation to describe physical phenomenon. However, a single word 
may be adopted with different meanings by scientists studying different 
disciplines. In this section, some of these overloaded terms are discussed. 

In general, the term polarization means splitting into distinct opposite 
parts. In this text, two types of polarization are discussed: material po¬ 
larization and electromagnetic polarization. If an external electric field, a 
voltage, is placed across a piece of material it will affect the material. If 
the material is at a temperature other than absolute zero, the electrons are 
in constant motion. However, the overall electron location will shift when 
the external electric field is applied. The term material polarization refers 
to the fact that when an external voltage is applied across an insulator, the 
electrons slightly displace from the nucleus, so the atom is more negatively 
charged on one end and positively charged on the other. Material polar¬ 
ization is discussed beginning in Section 2.2.1. The other use of the term 
polarization describes how electromagnetic waves vary with time as they 
propagate through space. Electromagnetic polarization specifies the direc¬ 
tion of the electric field with respect to the direction of propagation of an 
electromagnetic plane wave. It is discussed in Section 4.4.4. A propagat¬ 
ing electromagnetic field may be classified as linearly polarized, circularly 
polarized, or elliptically polarized. To determine the electromagnetic po¬ 
larization of a plane traveling wave, project the electric field ^(t) onto a 
plane perpendicular to the direction that the wave is traveling. If the re¬ 
sulting projection is a straight line, the wave is said to be linearly polarized. 
If the projection is a circle, the wave is said to be circularly polarized, and 
if the projection is an ellipse, the wave is said to be elliptically polarized. 

Another overused term is inversion. Inversion symmetry is discussed 
in Section 2.3.2, and population inversion is discussed in Section 7.2.4. If a 
crystal structure looks the same upon rotation or reflection, the structure is 
said to have a symmetry. If the crystal structure looks the same after 180° 
rotation and inversion through the origin, the structure is said to have in¬ 
version symmetry. This idea is illustrated in Fig. 2.8. The term population 
inversion is defined in the context of lasers. In a laser, LED, lamp, or other 
device that converts electricity to optical energy, a pump excites electrons 
from a lower to a higher energy level. A population inversion occurs when 
more electrons of the active material are in the upper, rather than lower, 
energy level. Lasing requires a population inversion. 

The word potential is also quite overloaded. Electrical engineers some¬ 
times use potential, as well as the term electromotive force, as a synonym 
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for voltage. (As an aside, reference [6] carefully distinguishes between these 
three terms.) In this context, potential, like voltage, has the units volts. 
The term chemical potential Uchem has units of joules per atom, and it 
represents energy where the probability of finding an electron is one half. 
For a pure semiconductor, the chemical potential is in the middle of the 
energy gap. Semiconductor scientists typically use the term Fermi energy 
level Ef instead. These terms are discussed in Sec. 6.3.3 and 9.2.3. Volt¬ 
age times charge is energy, so chemical potential can be thought of as a 
voltage times the charge of an electron. The term redox potential Vrp is 
equivalent to the term voltage used by electrical engineers, and it has the 
units volts. It was introduced in Sec. 9.2.5. It is used, typically by exper¬ 
imentalists, in discussing the voltage that develops across electrodes due 
to oxidation reduction chemical reactions. In the discussion of calculus of 
variations in Chapter 11, the idea of generalized potential was introduced. 
It is a parameter used to describe the evolution of an energy conversion 
process. Voltage and chemical potential can both be examples of gener¬ 
alized potentials. Generalized potential has units of joules over the units 
of the generalized path. The choice of the word generalized potential in 
calculus of variations follows reference [194, p.11-19]. Another related term 
is potential energy. Potential energy is a form of energy, and it is measured 
in joules. If we raise an object against gravity, we say that the object gains 
potential energy, and if we compress a spring, we say the spring gains spring 
potential energy. 

The related words capacitor, capacitance, theoretical capacity, and gen¬ 
eralized capacity are used in this text. A capacitor is one of the most 
common circuit components, and capacitors are discussed in Sec. 1.6.3. A 
capacitor is a device constructed from conductors separated by a dielectric 
layer. It is specified by a capacitance C, in farads, which is a measure of 
the ability of the device to store a built up charge, hence store energy. The 
permittivity e describes the distributed capacitance, in jjj, of an insulating 
material. As discussed in Sec. 9.4.1, chemists use the related term the¬ 
oretical capacity in a different way, as a measure of the charge stored in 
an battery or fuel cell. It is measured in coulombs or ampere hours. The 
adjective theoretical refers to the total amount of charge stored, not the 
charge that can be practically extracted. The idea of generalized capacity 
was introduced in Sec. 12.2 as the general ability to store energy. As with 
other concepts of calculus of variations, the units of generalized capacity 
depend on the choice of generalized path and generalized potential. 

Conductivity describes ability of some particles to flow. Electrical con¬ 
ductivity a describes the ability of charges to flow. It was introduced in Sec. 
1.6.3 and discussed further in Sec. 8.6.2 and 9.2.1. Thermal conductivity 
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K describes the ability of heat to flow, and it was introduced in Sec. 8.6.3. 
Both of these ideas were discussed in the context of thermoelectric devices 
because understanding these devices requires the understanding of the flow 
of both electrons and heat. Example values for electrical conductivity and 
thermal conductivity are given in Table 8.4. 

While not identical, the terms wave vector and wave function are worth 
distinguishing. The term wave vector was introduced in Sec. 6.4 with the 
idea of reciprocal space. Functions such as energy or charge density can be 
described as varying with respect to position specifled by the vector in 
units of meters. These functions can also be described as varying with re¬ 
spect to spatial frequency specifled by the wave vector k in units A-. Wave 
function tjj was introduced in Section 11.7, and it is a fundamental idea of 
quantum mechanics. The wave function is a measure of the probability of 
flnding an electron or other quantity in a particular state. 

Appendix D: Specific Energies 

Table 14.4 lists the speciflc energy of various materials and devices. These 
are representative values, not values for speciflc devices. Batteries by differ¬ 
ent manufacturers, for example, will have a range of speciflc energy values, 
and these values are often detailed in a datasheet. See the listed references 
for additional information on the assumptions made. Two types of values 
are listed for batteries. Both theoretical speciflc energy values for the chem¬ 
ical reactions and speciflc energy values for practical devices. The notation 
(th) indicates theoretical values while (pr) indicates practical values. NMH 
is an abbreviation for Nickel Metal Hydride batteries. 
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Material or Device 

Specific 

Energy in g 

Specific 

Energy in 

W-h 

kg 

Ref. 

Uranium 

6.77-101° 

1.88 - 10i° 

[3] 

Hydrogen 

1.18 ■ 10^ 

3.28 - lO^ 

| 195 | 

Gasoline 

4.64 ■ IQi 

1.29 - lO^ 

| 195 | 

Petroleum (crude) 

4.4 ■ IQi 

1.2 - 10^ 

[1] 

Coal (high quality) 

3.4 ■ 10^ 

9.4-10° 

[1] 

Methanol 

2.19 ■ IQi 

6.08 - 10° 

| 196 | 

Ammonia 

2.00 ■ 10^ 

5.56 - 10° 

| 195 | 

Coal (low quality) 

1.6 ■ lO^ 

4.5 - 10° 

[1] 

Sugar 

1.57-10^ 

4.36 - 10° 

[196] 

Hydrogen oxygen fuel 
cell (th) 

1.32 ■ lO^ 

3.66 - 10° 

| 128 | 

Lithium ion battery (th) 

1.61 ■ 10° 

448 

| 128 | 

Alkaline battery (th) 

1.29 ■ 10° 

358 

| 128 | 

Lead acid battery (th) 

9.1 ■ 10^ 

252 

| 128 | 

NMH battery (th) 

8.6 ■ 102 

240 

| 128 | 

Lithium ion battery (pr) 

7.2 ■ lO^ 

200 

| 128 | 

Alkaline battery (pr) 

5.54-102 

154 

| 128 | 

NMH battery (pr) 

3.6 -102 

100 

| 128 | 

Lead acid battery (pr) 

1.3 -102 

35 

| 128 | 

Rubber band 

7.9 

2.2 

| 195 ] 


Table 14.4: Specific energy of various materials and devices. For batteries, 
(th) indicates theoretical values, and (pr) indicates practical values. 
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About the Book 


Direct Energy Conversion discusses both the physics behind energy con¬ 
version processes and a wide variety of energy conversion devices. A direct 
energy conversion process converts one form of energy to another through 
a single process. The first half of this book surveys multiple devices that 
convert to or from electricity including piezoelectric devices, antennas, so¬ 
lar cells, light emitting diodes, lasers, thermoelectric devices, and batteries. 
In these chapters, physical effects are discussed, terminology used by engi¬ 
neers in the discipline is introduced, and insights into material selection is 
studied. The second part of this book puts concepts of energy conversion 
in a more abstract framework. These chapters introduce the idea of cal¬ 
culus of variations and illuminate relationships between energy conversion 
processes. 

This peer-reviewed book is used for a junior level electrical engineering 
class at Trine University. However, it is intended not just for electrical 
engineers. Direct energy conversion is a fascinating topic because it does 
not fit neatly into a single discipline. This book also should be of interest to 
physicists, chemists, mechanical engineers, and other researchers interested 
in an introduction to the energy conversion devices studied by scientists and 
engineers in other disciplines. 
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